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The purpose of this research was to study 
expectations of elementary school children in two vayss 
experisentally and observationally. Expectations say be roughly 
defined as a child's ideas of his own ability at a particular task. 
Pros the data it appears that childrens* expectations could be raised 
experimentally by a suitable adult and high expectations in one area 
generalize into other unrelated areas. The experiments are summarized 
in a number of published articles reproduced herein and listed in the 
bibliography. The observational data focus on children in first and 
second grades in a white middle-class school and in an integrated 
lower class school. From the time they enter school individual 
children are followed to see how their expectations for their own 
performance in reading, arithmetic, and conduct develop. Their 
expectations, and their parents* expectations for them, are 
repeatedly measured. Children in both places have, on the average, 
very high expectations for themselves before they get a report card, 
higher than their parents. These expectations do not diminish much 
when marks are lower than er.pected; in fact for the majority, 
expectations are maintained over first grade. Children whose marks 
improve are likely to be those whose expectations exceeded marks. 
(Author) 
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Abstraot'i^ 



The purpose of this research was to study expectations of 
elementary school children In two ways: experimentally (Section l)» 
and observatlonally (Section II). Expectations may be roughly defined 
as a child's Ideas of his own ability at a particular task. 

The experiments focus on (1) how children's expectations for their 
own performance may be raised, (2) what kinds of persons can raise 
^children's expectations. (3) what kinds of children etre susceptible to 
expectation raising, {k) whether children's expectations generalize 
from one area to another, {3) whether children express behavlorally 
their expectations for other children, and (6) what kinds of data 
shape children's expectations for other children. Children's expecta- 
tions cw be raised experimentally by a suitable adult and hi^ expecta- 
tions in one area generalize into other unrelated areas. The experiments 
are summarized In a number of published articles reproduced herein and 
listed in the bibliography. 

The observational data focus on children in first and second grades 
in a white middle-class school and in an integrated lower class school. 
From the time they enter school individual children are followed to see 
how their expectations for their own performance in reading, arithmetic » 
and conduct develop. Their expectations, and their parents' expecta- 
tions for them are repeatedly measured. Children in both places have, 
on the average, very high expectations for themselves before they get 
a report card, higher than their parents. These expectations do not 
diminish much when marks are lower than expected, in fact for the 
majority, expectations are maintained over first grade. Children whose 
marks improve are likely to be those whose expectations exceeded marks. 



* The reader should also see the "Ovex'vlew", p. xlv. 
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HhiB research has been conducted over a S-year period. It had its 
genesis in work done even earlier, starting with ei^loratory studies 
in 1969* There is a clear intellectual debt owed to Joseph Berger and 
his associates at Stanford University who have fonoulated and tested 
Expectations States Theory in the laboratory. T:.i a very real way our 
research represents the extension of that research into field settings. 
The theory led us to work directly with children's es^ectations. It 
also led us to some observational work, to see what events in young 
children's academic careers lead them to fomulate hi^ (or low) 
esQectations for themselves. 

Like most research, that reported here probably raises more 
questions than it answers. That it can answer any is due in a very 
real sense to the cooperation and patience of teachers and children 
in schools in both Baltimore County and Baltimore City. Where possible, 
individual principals and others are identified in reports reproduced 
for inclusion in this Final Report. In the case of Section II, which 
covers the observational work, confidentiality requires that we do not 
identify the particular schools involved in the study. We acknowledge 
here the splendid cooperation we have received from these schools, and 
regret that they must remain emonymous even here. 
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OVERVm 



This final report is divided into two sections, Section I: 
Eiq^erimental Studies of Expectations and Section II: Observational 
Studies of Expectations. Each section deals with a series of studies 
on children's expectations. 

Section I is devoted to experimental studies and consists mainly 
of a series of papers which present and summarize research supported 
by this grant. These will be numbered serially to facilitate class- 
ifying. Three papers have already appeared in print, (l) "Raising 
Children's Performance Expectations," Social Science Research, 1972, 
1 (June): 1U7-158, (2) "Research Notes: Status Factors in Expectation 
Raising." Sociology of Education . 1973, h6 (Winter): 115-126, (3) 
"Raising Children's Esqpectations for Their. Own Performance: A Class- 
room Application, " Chapter 7 in Expectation States Theory . J. Berger, 
T. Conner, & M. H. Fisek (Eds.), Cambridge: Winthrop Publishers, Inc., 
197^. (h) "Expectations in Mixed Racial Groups" will appear in the 
summer in Sociology of Education , and (3) "Expectation Effects on 
Performance Evaluation" has been submitted for publication. Others 
("Teacher Expectancies and Student Expectation States," Pacific 
Sociological Association, April 1972; "Raising E33)ectatlons in the 
Clarssroom," Eastern Sociological Association, April 1972; "Status 
Factors in Expectation Raising," American Sociological Association, 
August 1972; "Expectations in Mixed Racial Groups," American Educational 
Research Association, February 1973; "Development of Expectations in 
Mi::ed Racial Groups," American Psychological Association, September 1974) 
have been presented at professional meetings and overlap to seme degree 
the papers in print. 

These papers have appeared throughout the term of this research 
project, and each summarized a particular series of field experiments 
with grade school children. The experiments focus on (1) how children's 
expectations for their own p«irforaance may be raised, (2) what kinds 
of persons can raise children's expectations, (3) what kinds of children 
are susceptible to expectation raising, {h) whether children's expectations 
generalize from one area to another, (5) whether children express 
behaviorally their expectations for other children, and (6) what kinds 
of data shape children's expectations for other children. All of the 
experiments are concerned directly with children's espectations. 

The first stage of the exgperlmental research was to analyze class- 
room problem solving interaction from a theoretical perspective which 
defined the relevant variables and predicted their interrelations. 
Experiments then were conducted to demonstrate the utility of our 
theoretical analysis; to show, that is, that we could predict the 
effects of specific types of experimental Intervention in naturalistic 
classroom studies. Each exqperlment focuses wither on a significant 
link in the theoretical process, or upon elaborating subpopulation 
differences revealed In a previous experiment. 



ERIC 



xlv 



J 6 



Section IX Is devoted to observational studies and presents 
longitudinal data on expectations of children in three cohorts: (1) 
white middle-class children who entered first grade in 1971; (2) white 
middle-class children who entered first grade in 1972; (3) white and 
black lower-class children who entered first grade in 1972. From the 
time they entered school these children have been followed to see how 
their es^ectations for their own performance develop and change. The 
children's expectations for their marks in three subject areas 
(Reading, Arithmetic, and Conduct) have been tabulated twice each 
academic year (before the first report card in the fall term and Just 
before the end of the spring term). Ih addition other data character- 
izing the children (iQ's, marks in Reading, Arithmetic, Conduct, 
self-esteem test scores, and the like) have been studies over tine 
to see how children's expectations relate to them (e.g. in the case of 
IQ), or how children's expectations respond to them (e.g. in the case 
of marks). 

The child has been studied within his social network. Parents' 
expectations, some teacher forecasts for achievement, and the child's 
socicmetric standing in his own class have been studied in relat.ton 
to the child's expectation level. Besides portraying individual 
children and cohorts over time, the observational data can be 
aggregated by cohort, or in the case of the two middle-class cohorts, 
across cohorts as well for the first grade year. (Further work is 
planned continuing with these cohorts until the end of third grade.) 
The aggregation permits examination of a number of important problems, 
for exaiqple, whether male and female children are the targets of 
differential expectations by parents for achievement in reading or 
arithmetic. Or, whether black children and white children have different 
expectation levels or have different patterns of e3Q>ectatlo& change 
over time. 

Hie work is novel in several respects. E3Q>ectations of children 
in the first four grades of school have not previously been studied. 
Cohen and her associates have carried out some experiments with Junior 
high students (especially blacks), and Brookover and his associates 
have carried out a longitudinal study of students' "self-conception of 
ability" lasting from the 7th to the 12th grade which included some 
experimental phases. But other than these two efforts, there are no 
studies aimed at goals similar to the goals of ours, and both of these 
cited studies concern much older children. One need not labor here the 
difficulties Involved in working with children below the age when they 
can respond easily in writing using group procedures, or when they are 
easily interviewed. But a major novelty in our work springs froa the 
tender age of the subjects. 

Another novel feature is that the actual process of expectation 
formation has not been studied previously. Expectations for school 
performance, or related variables like aspiration level and locus of 
control, are often studied in Junior high or high school age subjects 
but no one has ever Inquired aa to how these expectations are initially 
established , in previous research expectations are assessed which 
already exist at the time a study begins. iRils is important for two 

er|c 



reasons. First, there is considerable sentiment that by third grade 
(the terminal level for most of our work) children's performance levels 
are established and will not change much for the remainder of their 
school career (Kraus, 1973; Husen, 1969; Bloom, 1964). Thus instead 
of viewing expectations as a characteristic which has potential for 
change, expectations are viewed as more or less fixed, like IQ. Second, 
expectations in their early stages of formation lead to feedback loops- 
expectations determine performance which then determine expectations. 
These loops may become so functionally autonomous as to defy inter- 
vention in later childhood. 

Still another novel aspect is the comparison by social class 
level of the effectiveness of various kinds of agents as expectation 
raisers and the impact of feedback from different persons on expectation 
level. To our knowledge no one before has looked into how racial mis- 
match between teachers and primary age children may affect performance, 
and there are few data on how social class mismatch between teachers 
and children affects feedback on performance. In one set of experiments 
{k) we find that black adlults are relatively ineffective at raising 
expectations of white lower-class children although they are effective 
with black lower-class or white middle-class children. We also find 
in the observational study that the range of marks used by first-grade 
teachers is very different between the middle-class and the lower-class 
schools. Feedback in the way of report cards inplies rather different 
things to children in these two locales. 

A still further source of novelty is that, with the exception of 
the Brookover and Cohen projects refered to above, work directly with 
children's expectations , rather than indirectly with teachers' 
expectations, has not been the topic of research in "expectancy" studies. 
A shower of recent studies concern teacher expectancy or Pygmalion effects. 
These have two aims: (1) demonstrating that expectations teachers hold 
for pupils shape pupil performance, or (2) pinpointing teacher behaviors 
that mediate the expectancy effects. 

A number of studies give teachers false information that some 
children are "gifted" or "late bloomers" when in fact the students so 
designated are randomly selected. Then the children's subsequent 
performance is evaluated. A review of the statistical evidence on the 
Pygmalion effect, given in Rosenthal and Rubin (1971), suggests that 
about 3S^ of such maneuvers are successful. To explain unsuccessful 
maneuvers one can surmise that In some cases teachers pay no heed to 
the "information" given them or may forget it. Also apparently teachers 
act out their expectations in different ways, determined at least in 
part by the characteristics of the students with whom they are inter- 
acting. Our experimental work goes far toward elucidating how the actual 
process of how teacher expectations affect performance. This is discussed 
in detail in both (3) and (4). 

Studies that seek to understand exactly how teachers behave to cause 
"high-expectancy" . "»ils to perform better are less numerous but more 
iiUportant in terms Ji" implications for practice. The Brophy and Good (1970) 
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research, for example, reveals that teachers demand more from students 
for whom they hold high expectations. Closely related are "ex post 
facto" naturalistic studies like the recent one (Seaver, 1973) showing 
that younger siblings are rewarded to some extent on the basis of 
teachers' prior experience with their older brothers and sisters, or 
the Palardy (1969) study showing that those teachers who es^ect boys 
to be slower readers than girls have such boys by the end of first 
grade. 

Anyone would suppose, however, to operate on students* expectations 
directly is the most effective way to mobilize expectations toward 
educational goals. A severe limitation to the Pygmalion work, if its 
implications are to be followed in applied work, is that deception Is 
involved. To tell teachers that some students are "gifted" may be 
defensible to demonstrate an effect, as a tour de force , if you will. 
But to give teachers false or misleading information about pupils is 
hardly a practice that can be counteneuiced in everyday affairs. To 
tell teachers that some students are "gifted" furthermore Implies 
"more gifted than others," or that others are not gifted. This negative 
implication we have never seen explicitly mentioned, although one must 
presume, since attention and reinforcement are distributed, that the 
pupil's falsely designated as non-gifted are perhaps denied the attention 
they need to flourish. The important point, however, is that presumably 
pupils' esgsectations for themselves are shaped by teachers' expectations. 
Tti&t being the case, pupils' e^ectations may be a more potent and 
therefore relevant variable to serve as a research target. 

In an important way our research fuses or e3g;>ands upon a nuinber of 
important methodological approaches. The experimental portion of the 
research took its impetus from tight, carefully controlled, laboratory 
studies where the dependent variable was susceptibility to Influence 
in cases of disagreement. Will an individual, for example, change his 
response to an ambiguous stimulis when confronted with a disagreeing 
alter for whom he previously holds high expectations? E55)ectRtions in 
laboratory work have uniformly been the topic of the e:Q)erimental 
maneuver rather than the dependent variable. Expectations are manipulated. 
Subjects are told fictitiously high scores or given false information 
about alter 's status to they will hold high expectations for him com- 
pared to themselves. 

It is quite another matter to cause expectations to change by 
some sequence of believable events rather than by fictitious assignment. 
Also it is quite another matter to look for a behavior which the subject 
generates sponteuieously, like volunteering to perform, rather them to 
ally responses to an experimenter- imposed task like Judging slides. 
Resisting or accepting influence which is artifically generated is much 
simpler to arrange, and much less like an ordinary life circumstance, 
than is deciding on one's own initiative to perform or to volunteer. 
We do not wish at all to denigrate the laboratory work but we do wish 
to point out that laboratory research on expectation problems, although 
valuable, has constraints that limit its value for predicting or con- 
trolling behavior in true-to- life settings. Expectations for the self 
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or for others arise gradually (vs think) in social comnerce. They 
then affect performance and evaluations, both key variables for future 
expectations and performance. While there may be instances when 
resolution of disagreement is involved, as say when a teacher and 
a student interpret a symbol in a poem differently and the student 
subsequently accepts the teacher's interpretation, this consequence 
of expectations is a relatively minor one from our viewpoint. The 
consequences of expectations in terms of activity closely related 
to learning like performing or volunteering to perform are major 
consequences in our view and the ones we have studied. 

In another way the experiments present a novel methodology. 
All subjects are maintained in a group context. E39ectation8 by 
their nature are partly distributive, awarded by persons as groups 
undertake important tasks. Thus it mak<»s no sense to speak of 
"high expectations" in any absolute sense. One can speak only of 
"esqpectations high with respect to this group" or "expectations low 
with respect to this set of competitors." Failure to preserve 
social context has led to considerable confusion about the self- 
esteem and self-image of blacks or other disadvantaged groups. 
Study after study has appeared demonstrating that minority group 
children usually have as high, or higher self-esteem, than majority 
children, lliis result has been perplexing because it was thought 
that the self- view would reflect the society's view of the minority 
group the self belonged to. Work generally on children's self-esteem 
or self-concept has pretended the individual exists in social 
isolation. The present work treats children nested in social groups. 
Volunteering in our experiments is measured with respect to others 
in the seune group . Likewise in the observational work self-esteem 
or expectations is assessed in terms of the child's position in his 
classro(»n or grade. The iiqportance of the group ccmtext cannot be 
stressed too much. In Cohen's experiments, for example, expectations 
of blacks for their own performance or of whites for blacks' 
performance are low in mixed racial work groups. In these experi- 
ments the blacks are brought from all-black settings. Our experi- 
ments with younger children in an Integrated school occur in a 
mixed racial environment so no change in context occurs. Under 
these conditions blacks and whites both appear to have higjh 
es^ectations for blacks. Different contexts and contextual effects 
cannot be ignored. 

Perhaps the most significant methodological contribution of this 
research is in the observational portion. It combines the life-span 
developmental approach typical of longitudinal studies in psychology 
with the causally complex structural equation approach characteristic 
of so much recent sociological research, especially research in 
occupational or status attainment. 

Many psychological studies over time have the intent of elucidating 
progression of growth, the exact nature of growth curves and the like. 
Oden (1966), for example, recently brought the Terman gifted sample 
up to date. Gifted children were traced through adulthood, their 
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physical and mental health and a host of other variables were observed. 
One cannot say, ho«rever, what factors lead to the superior mental 
health of the gifted. Certain dependent variables are repeatedly 
measured in the same individuals over time) and the status of the 
target group on these variables is cooopared with that of the general 
population. But very little in the way of exact causation can be 
firmly stated from this typical exaiqple of longitudinal studies in 
psychology. 

In recent sociological studies over time, by contrast, one or 
a small number of dependent variables is measured at one point in 
time, with a few independent variables measured at a single prior 
point. For example, Alexander's work (1973)* using data gathered 
earlier by Eckland, examines occupational attainment in 1972 of a 
sajqple of 1-2000 persons who were interviewed in 1953 when they 
were either sophomores or seniors in high school. The 1933 interviews 
assess peer plans, parents' aspirations, family background, and 
father's occupational status. The object is to account for occupa- 
tional attainment of the aggregate saQple in 1970 or the basis of the 
set of variables evaluated in 19^3. There is no interest in predicting 
that a certain individual will follow a particular life course or 
attain a particular status. Rather mobility of the group is measured 
as a function of the prior variables, and causal chains are sought. 

As mentioned, the present observational study draws from both 
these traditions and is, so far as we know, the first attempt to 
combine the two for a single data set. Individual children are 
repeatedly measured over time to see what course their self-expectations 
take (like the life-span psychological studies). But measures of 
parent and peer e3Q)ectation8 are taken to see what effects appear later 
in the child's expectations that can be causually related to variables 
measured earlier (like the causally coi^plex sociological studies). 
Escpectations of black children can be measured, for exaiqple, in order 
to study the impact of earlier feedback from teachers and parents 
and to assign weights to the importance of feedback from various sources. 
These matters will be extensively discussed later in Section II. 

To sum up: IVo sorts of work will be presented, one experimental 
and one observational. They are conceptually closely related. The 
e3q)erimental studies are carefully controlled attempts to raise 
children's expectations under different conditions. The observational 
studies are focused on what events occur naturally to raise (or lower) 
children's expectations. All the work concerns children in the 
earliest school grades. Although escpectations are presumably most 
malleable at this time, no other research has previously been concerned 
with this variable at this age. 
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SECTION 
EXPERBQSfTAL STUDIES 
Doris R. Entwlsle and 



I 

OF EXPECTATIONS 
Murray Webster, Jr. 



Introduction 



The purpose of this section is to describe a sequential series of 
ea^erimental studies of the determinants and consequences of performance 
expectations in classrooms. In line with U.S.O.E. 's goals "to produce 
and disseminate knowledge," we have published, or are preparing to publish, 
reports of all of these studies. Consequently this narrative omits many 
technical details, theoretical qualifications, and unusual or atypical 
cases in the interest of presenting a coherent view of the research pro- 
gram. We have provided references to the more detailed papers, which 
are appended to this report. 

By the time this grant became active in June 1971 » we had already* 
investigated in a preliminary way some of the theoretical issues described 
here, and had conducted some pilot studies of the basic experimental 
design. Although this early work was e^qploratory and many refinements 
in our design and our thinking were necessary before the program produced 
useful research data, the contributions at the early stages were vital. 
NIMH grant No. MH-I8183 and The Johns Hopkins University Center for the 
Social Organization of Schools provided funding of the early stages. 

At every step of the way our investigation is closely linked to 
em explicit theory. By "explicit theory" we mean a set of propositions 
explicitly stated, with a concern for parsimonious, consistent use of 
concepts and logical consistency of the propositions; and some independent 
empirical tests of derivations from those propositions. This feature is 
an unusual one in educational research, and we see it as a major strength 
of our program. Among the advantages of this "theoretically informed 
strategy" are the following. (1) The theory isolates crucial variables 
from incidental features of a situation; for exanple, it tells us to 
focus upon children's escpectations for themselves rather than teachers ' 
expectations for children. (2) "Hie theory gives a way to analyze previous 
findings and points out relations between findings from very different 
concrete settings; for example, it shows similarities between the "Pygmalion" 
studies, our classroom experiments, euid a large number of small group 
laboratory studies. (3) The theory explains some findings and predicts 
others; for exaiqple, in some tf our publications we analyze the "Pygmalion 
effect" and show how both successful and unsuccessful attenqots to produce 
it may be e3cplained--in contrast to the mysterious nature this effect 
assumes when it is mentioned by other researchers. (U) The theory guides 
research by predicting effects of contemplated intervention techniques; 
thus all experiments reported in this section use direct operational- 
izations of the theory. 

Ttie theory which guides these experiments is presented in Webster 
and Sobleszek (197^a, 197Ub) and in Entwisle and Webster (197Ua). 
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Basically, it is concerned with the relations of evaluations of performance 
and ability conceptions. The theory is sketched in the paragraphs which 
follow. 

In the experimental studies our concern was to develop substantively 
important applications of previously confirmed parts of the theory, not 
to test the theory itself. The strategy of applying only what we con- 
sidered reliable derivations enables us to concentrate upon developing 
practical techniques and measures, not upon theoretical development and 
reformulation. In other words, because we used only independently con- 
firmed theory to guide our research, we could concentrate attention upon 
measures, eaqaerimental design, and intervention techniques when we got 
unexpected results; not on the propositions which led to the disconfirmed 
prediction, in a sequential research program. Just knowing where the 
problem lies when things go wrong is exceedingly valuable. 



Expectations in the Classroom 

An elementary school classroom may be viewed as a social situation 
in which evaluations of performance, ability of actors' conceptions of 
their own and each other's abilities, and chances to perform are of 
central interest. Thus the primary relevant components of normal pro- 
blem solving interaction are the following: action opportunities or 
socially distributed chances to perform, such as those the teacher dis- 
tributes to children by calling on them; performance outputs or problem 
solving attempts, such as children make when they offer an answer; 
positive or negative unit evaluations of individual performance outputs, 
which are communicated by the teacher and also by other students. All 
of these, as well as other components not mentioned here in the interest 
of brevity, are distributed among actors as a direct function of the 
es^ectation state associated with each actor. Tkie higher the expectations 
held for a given actor, the more likely he is to receive an action 
opportunity, to accept a given action opportunity and make a performance 
output. Thus, although es^ectation states are themselves unobservable, 
they are theorectically assumed to affect the social distribution of all 
relevant components of task focussed interaction. 

Expectation states are persistent— though not unchangeable— con- 
ceptions of ability which arise as a result of social interaction. It is 
crucial to note that expectations are always relative: an individual 
holds expectations for himself relative to particular other individuals 
and at a particular task. (This is one reason it makes no sense to speak 
of the "low self-esteem of blacks" without specifying conparison others 
and a particular task* ) In fact, it is partially due to this relative 
conceptualization that we are able to raise expectations of children in 
various social situations using a fairly simple intervention technique. 

Expectation states are formed in either of two basic ways, though 
there are variations on each. First, a series of unit evaluations of a 
particular actor's performances will at some point crystallize into an 
expectation state held for him. We assume, in other words, that after 



ERIC 



a series of positive unit evaluations a child will move from saying to 
himself "I think I was right" to "I think I have high ability at this 
task. " At about the same time, of course, other children will make the 
comparable cognitive change: from "l think he's right" to "I think he 
has high ability." The only factor that matters for expectation forma- 
tion is the positive or negative unit evaluations of performance. We 
call this the "basic evaluation-expectation process." 

Second, in the absence of evaluative information relevant to the 
task, actors may form estpectations on the basis of any known evaluated 
characteristic possessed by the individual. The known evaluated char- 
acteristic may be another ability-- such as when a teauher says "Bill is 
pretty good at reading, so he probably can spell we'll"— or it could be 
a status characteristic— such as when a child says "Most blacks are 
dumb." We call this the "spread of relevance process," since a previ- 
ously non-related characteristic becomes relevant to expectations held 
for particular actors in this case. 

Whichever way e^gpectation states for actors come to be held, they 
tend to persist unless acted upon. There are also reasons why esqpecta- 
tion> states tend not to be acted upon or changed. The main reason is 
that once expectation states for actors are formed, they affect the dis- 
tribution of unit evaluations, emd thus act to reinforce the very con- 
ditions which led to their formation. This circular evaluation-expecta- 
tion-evaluation process is what we call its "self -maintaining feature." 

Figure 1 represents the basic expectation-evaluation process, the 
spread of relevance process, and the self-maintaining feature of expectation 
states within the classroom. In interpreting the flow chart, assume 
that influences (represented by arrows) going upwards are weaker in 
effect than those going downwards. (Figure 1 is intended for illustrative 
purposes only; in particular, it should not be regarded as an incoioplete 
path analytic diagram.) In Section II of this Final Report we escpand 
the focus to include sources of expectation effects external to the 
immediate classroom, as well as some attempts to develop quantitative 
estimates of degrees of effect. 

Both for interpreting Figure I and in reading the following des- 
criptions of experiments the reader should bear in mind that (1) we are 
centrally concerned with determinants and consequences of individual 
children's expectation states, and (2) expectations are always relative 
to a particular task and to specific comparison others. 



First E^erimental Series; The Basic Expectation- Raising Eacperiment 

Initial experiments were concerned (l) with demonstrating some crucial 
steps in our theoretical analysis, particularly the way in which children's 
expectations are formed and the importance of the child's eipectations, 
and (2) with developing a means to produce results of practical signifi- 
cance. What we settled on to satisfy the second goal was a behavior 
generally accepted as important in classroom learning, willingness to 
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Figure 1 

Determinants and Consequences of 
Classroom Expectation States 



Evaluated 
Characteristics 




Accepting 
Action Opportunity 



engage in new tasks and problem solving attenpts, or to answer questions 
in class. Such behaviors are labelled "acceptance of an action oppor- 
tunity" in the terminology of e^ectation states tiieory, and they should 
vary directly with the child's (relative) expectations for his ability. 
Also a part of the second goal was to devise a procedure sinple enough 
to tell scHneone else— perhaps a teacher who is not particularly interested 
in experimental design or abstrcust theory— how to produce the same 
practical results. 

The first goal, confirming the correctness of our analysis, was 
tackled by inventing an e^gseriment to raise children's e^gpectations for 
a classroom-like task and setting. High esgpectations, according to the 
theory, can be produced by the basic evaluation- expectation process 
through giving children heavy doses of positive unit evaluations of 
performances. The combination of goals for these experiments iiqposed 
considerable contraints on the experimental task and situation; for 
example: (l) the task must permit a series of discrete, controllable, 
plausible unit evaluations; (2) it must resemble classroom activity, but 
it may hot be a task for which children already have expectations; (3) 
it must not rely on telling either teachers or children something false 
about their abilities; (k) it must be interesting enough to capture the 
children's interest, and (3) it must permit relatively objective measure- 
ments of the independent and dependent variables of the study. 

As the list of criteria suggests, considerable time and effort were 
required in pilot studies to develop a suitable task and design. The 
task finally developed is a modification of a story telling task used 
earlier for research on cognitive development (see Entwisle et al., 1970, 
appended) . The task uses "story skeletons" coi^posed of sentences with 
missing words, which children supply. A ssmple story skeleton is shown 
with supplied words underlined in Figure 2. Tiie esqperiment has three 
phases. In Phase I, groups of children— initially 6, but later groups 
of 1|— are assembled and told the experimenters are looking for children 
who can tell good stories. Stories are to be made up by teams, euid the 
best team stories will be awarded prizes. In each team, the e}Q)erimenter 
will read a sentence, and pause when she comes to a blank. Children 
then raise their hands if they can think of a good word , and one child 
will be called upon to give the "team's word." The more good words, the 
better the story, and the better the team's chances to win. 

Phase I and Phase III are measurement phases: expectation level 
is assessed by noting unobtrusively how often each child raises his or 
her hand. Hand- raising is our measure of relative es^ectation states 
for "story- telling ability" in that particular group. In Phase II, the 
attempt is made to raise one (experimental group) child's expectations 
by having him make up a story all by himself while the other (control 
group) children are in another room. Every single performance is given 
a positive evaluation in Phase II. Ihis procedure, if successful, 
should raise the child's self-expectations. In Phase III the groups 
are reconstituted, experimenters are rotated so they do not know who 
the experimental group child is, and the measurement is repeated. Figure 3 
summarizes the basic expectation raising experiment. 



Figure 2 
Sample of Story-Telling Task 



THERE WAS ONCE A (very tall prince) 

WHO HAD A (castle) 

THAT (HE, SHE) (lived In) 

ONE DAY (HE, SHE) HAD TO GO TO (the dungeon to see his prisoners) 

(HE, SHE) DID THIS VERY (angrily) 

BECAUSE (HE, SHE) WANTED TO (make sure they were there) 

THIS WAS VERY DANGEROUS BECAUSE OF THE (strong prisoners) 

WHO (WHICH) WAS (WERE) VERY (mean) 

IN ORDER TO FOOL THE (FILL IN) OHE (FILL IN) DRESSED UP 
AS A (another prisoner) 

IN SUCH A DISGUISE THE (FILL IN) LOOKED J mean) 

AND WHEN THE (FILL IN) SAW THE (FILL IN), THEY (welcomed him) 

THIS MADE THE (FILL IN) (feel pretty good) 

AND /COMPLETE STORY/ Che let his new friends goj. 
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Data are tabulated by calculating gain scores for children between 
Phase I and Phase HZ. This procedure allows each child to act as his 
own control for such extraneous sources of change as exoltment, growing 
familiarity with the situation, etc. 

The first experiments were performed with children from two very 
different settings, two black inner-city schools and two white rural 
schools. Children were third and fourth graders, both sexes (n a 272). 
In all groups, the meeui gain score of experimental group children 
exceeded mean gain for control group children. l!hus we concluded that 
we had some success at meeting our initial goals. However, the pro- 
cedure was less successful with the black children than with the white, 
and was less successful with fourth graders than with third graders. 
(For detailed references see Entwisle and Webster, 1972, 1973 
(all appended). These reports constitute the exploratory segments of 
the project.) 



Second Experimental Series 

The second experimental series was designed to explore the generality 
across sociological subgroups of the experimental findings that children's 
expectations could be raised at classroom- like tasks. Data were gathered 
from children in grades one through four of a white suburban school. 
Comparison of data from the first and second experimental series permits 
us to assess the effects of age and residential locus, as well as race. 
•The total number of children in the second series is 38I. 

Results again Indicatec. success at expectation raising. For all 
four grades of the suburban school, comparisons with earlier data showed 
no significant effects by grade or residential locus, and no interactions 
Involving any of these three factors. 

These data together with the first experimental series provide 
strong evidence for the generality of the expectation raising effect, and 
for wide applicability of the experimental maneuver devised. Although 
not all possible groups of children have been studied (for example, no 
black middle class children have so far been included), the sociological 
range is great enough for us to conclude that the phenomenon is a general 
one. (See for detailed references Entwisle and Webster, 1973.) 



Third Experimental Series 

The next set of e:q)eriments was directed to some special problems 
which arose during the course of the eeurlier experiments. The third 
series Investigated issues of theoretical analysis, of experimental 
methodology, and of several alternative Interpretations of our results. 

First, we wished to explore reasons for the relative lack of success 
at expectation raising with inner-city black children; especially, fourth 
graders. Although as earlier noted we were able to raise expectations 
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somewhat in this group, the effect was not large enough to attain statis- 
tical significance. We posited two possible reasons for the relative 
lack of success. (1) Attending a black ghetto school is often seen as 
causing children to learn that they will not succe'ecl. In our terms, 
black children receive much negative evaluation, and come to hold low, 
fixed e:Q)ectations for themselves. Our relatively single experimental 
procedure was then simply not sufficient to convince these children that 
they had high ability. We dubbed this the "debilitating effect of school" 
hypothesis. (2) An entirely different line of explanation, suggested by 
Kfttz and his associates (Katz, 1970), is that some black children perceive 
white adults (such as our experimenters) as hostile and Insincere; in our 
terms, perhaps the children did not believe the positive evaluations they 
were given. One might term this the "race-credibility gap." 

To distinguish between these interpretations, we returned to the 
same inner-city school a year later with black experimenters, and repli- 
cated the earlier experiments with black inner-city children. The total 
number of children in these repeat experiments was 76. Results indicated 
that black adults were very successful at raising expectations of black 
inner-city children even though previous attempts with similar children 
by white experimenters had been relatively unsuccessful. From this we 
were able to re;)ect the debilitating effect of school hypothesis, and to 
accept the suggestion contained in Katz's work. We also were sensitized 
to the significance of racial match or mismatch in these experiments, a 
question we pursued at length later. 

Second, we were Interested in the comparative lack of success with 
white suburban girls. We replicated the experiments with third and 
fourth grade girls at a second, equivalent suburban school (n » 6o). 
Again we were unable to produce statistically significant differences 
between gain scores for experimental and control groups, although the 
gain of experimental groups was consistently positive. We tentatively 
concluded that a "contrast effect" may have produced the failure: girls, 
especially middle class suburban girls, are rather accustomed to receiving 
praise and positive evaluation for their work. Our es^erimental technique 
contrasted against this uniformly positive background is not so impressive 
to girls as it is to boys, or to rural or inner city children whose gen- 
eral background is less positive. 

Third, in discussing our work with colleagues, an alternative 
"behavioral modification" interpretation of the effect of the experiments 
was offered. Essentially such an interpretation argues that children in 
the experimental group increase the rate of emission of a behavior which 
is reinforced, rather than raise their expectations. The links to an 
S-K versus a cognitive theoretical stance are obvious. Although we are 
not interested in ruling out every possible competing explanation and 
the ease with which alternative explanations can be invented makes that 
activity fruitless, we felt the behavioral modification Interpretation 
usually does not include any notion of cognitions of the individuals. By 
contrast, it is central to the expectation theory interpretation that a 
cognition — a self-referent idea--get8 changed. We see behaviors as the 
direct consequence of cognitions. 

r. < 
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On analytic grounds, we do not find the bdiavioral modification 
interpretation persuasive: it requires a long chain of unteatable 
assumptions regarding how variables in our experiment are to be inter- 
preted. The experimental children had additional "practice" telling 
a Phase II story, and they had more "personal contact" with the experi- 
menter. Mox« telling however is the fact that the actual behavior 
whose increase is measured~hand-raising— is never reinforced. • We 
evaluate (or reinforce) words spoken , not hanjffalslng activity. Still, 
some simpler variants of the behavioral interpretation could be offered. 

To assess the status of these ideas we conducted some "special 
control" experiments, m Phase II two children were selected: one to 
receive the standard expectation raising treatment, and a second (special 
control) child who made up a story by himself with an experimenter Just 
as the experimental child did, but without any evaluations of his per- 
formances. Insofar as possible, the special control child received 
identical experiences to those received by the experimental child excpet 
the crucial one, unit evaluations. Results of these experiments (n » 66) 
BhcM essentially zero changes or both control and special control 
children, and a strong significant effect for the experimental group 
children. We conclude these results are inteipretable only from our 
expectation raising perspective. (For detailed references see Entwisle 
and Webster, 197Ua). 



Fourth Experimental Series 

The purpose of the fourth experimental series was to study in 
more detail the effects of racial differences in formation and change 
of expectations, in particular, we were concerned with (1) the effects 
of racial match or mismatch between experimenters and subjects; and 
(2) the effects of the basic experiment using integrated groups of 
childen. All experiments to this point had used same- sex and same-race 
children in experimental groups. The first issue has iaplications for 
deciding who can be an effective source of expectations for children, 
and also for deployment of teachers in school systems. Bie second issue 
is relevant to the effects of school integration upon ability conceptions 
of children. 

(1) Results of experiments with children of the same race in each 
group show, generally, that white experimenters are effective expectation 
raisers for white children and black experimenters are effective with 
black children. White experimenters are effective with inner-city black 
children only in the case of third grade children. Black experimenters 
are effective with white children from both rural and suburban schools. 
The total number of children in these racial matching experiments is U72. 

(2) In one of the few integrated elementary schools in Baltimore 
(approximately 6o^t black children and teachers), we performed the expecta- 
tion raising experiment using groups of two white and two black children, 
and experimenters of both races. White experimenters were able to raise 
expectations for both black and white chiWren drawn from mixed- racial 
groupu of children; black experimenters, only with black children drawn 
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from the mixed- racial groups. We interpret the first finding to indicate 
that black children in this school, in contrast to black children dis- 
cussed in the third experimental series, do not perceive white adults as 
hostile and insincere. The possibility exists that the second finding 
indicates that white children at this school do perceive black adults 
as hostile. The number of children in this set of experiments is 116. 

A second finding for these studies of integrated groups is that 
in Phase I, before any intervention, we do not find evidence that child- 
ren are forming expectations for themselves and each other on the basis 
of the race. This finding is striking in view of considerable research 
in segregated schools (for exaiqple, Cohen and Roper, 1972) showing 
effects of low expectations for blacks in mixed racial groups. We 
interpret this result to be one of the few quantitative pieces of 
evidence of positive consequences of school integration. (See detailed 
reference: Entwisle and Webster, 197Ub. ) 



Fifth Experimental Series 

In this set of experiments we addressed ways in which e^ectations 
might generalize. Do high expectations for one task spill over to 
affect other tasks? We noted earlier that often a teacher cannot hon- 
estly give positive evaluations to a student's eeucly problem solving 
attempts a new task. We therefore set out to explore what might be 
termed "contagion effects." 

For this we developed a second task called "meal planning." Phase I 
and Phase III utilize the story-telling task, as before. Phase II uses 
only a meal planning task in which children select from a board pictures 
of foods which would go together well in a holiday meal. In these "gen- 
eralization" experiments, we assume that high expectations for the meal 
planning activity will "generalize" to the story-telling activity. 

Results of experiments with dO white children and 72 black children 
confirm our prediction. In both sets of experiments the race of experi- 
menters and children was matched. (See for detailed reference: Entwisle 
and Webster, l97Uc (appended).) 



The Evaluation Experiments 

The five series using the basic experimental design described in 
the foregoing completed our investigation of several processes involved 
in the expectation- evaluation sequence as we hypothesized it to operate 
between teachers and children in classrooms. A different sort of con- 
sequence is also predicted to depend upon expectations: the nature of 
unit evaluations given performances. The higher the expectations held 
for a given actor, the more likely is anything he says to be positively 
evaluated. This process is represented in Figure 1; it is another 
crucial unobserved link in our analysis of the Pygmalion studies, since 
it models how teachers' expectations get transferred to students. 
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The evaluation experiments (n = 36O) explored the evaluation process 
in three slightly different conditions. In all conditions, an entire 
class of students was given sentences supposedly completed by students 
at another school. Each sentence was described as a performance trtm 
a student either or high ability or of low ability. In one classroom, 
odd numbered sentences were described as coming from high ability 
students; in a second classroom, even numbered sentences were described 
as coming from high ability students. In condition 1, only the "ability" 
levels of sentence authors were given. In condition 2, "ability" was 
described, and this ability was said to be relevant to story-telling 
ability. In condition 3) the ability was described and said to be rel- 
evant, and in addition, the author was said to have told good (or bad) 
stories in the past. Utie three conditions thus are increasingly 
defined social situations. According to the theory, all should produce 
about the same degree of effect. 

To measure expectations the children who are subjects give a 
"grade" to each sentence. Ihe grade given when the author is described 
as having high ability is compared to the grade given when he is 
described as having low ability. The same sentences cure used for 
both. In all three conditions of social specification, most sentences 
were Judged as better when the author was described as having hij^ 
ability than when the supposed author had low ability.. The entire set 
of escperiments was run twice, six classrooms per replication. Effects 
are entirely consistent over the two replications. (See for detailed 
reference: Webster and Entwisle, 197U (appended).) 



Summary 

Hhis completes the experimental research conducted under this 
program. Esgperiments have been devised to study the formation and the 
consequences of expectation states among elementary school children, and 
over 1,000 children have participated to date in these experiments. 
Figure h gives a summary of results and illustrates the major features 
and outccxnes of these experiments schematically. 

In a reconstruction of a research program, there is often an 
appearance of simple linear progression towards understanding and truth. 
What is more accurate, of course, is that investigators progressed from 
some point to the nexJ point and then on from there. Tbrouj^out, our 
strategy has been to apply the perspective of expectation states theory 
to certain problems Important in education, and we have moved from pre- 
liminary results and confirmation to new issues. The theory and its 
laboratory confirmation gave us a starting point and provided us a 
guide for our classroom research. Neither of these could foretell 
final results until the field experiments were done. The research 
program, as shown in the appended reports, contained numerous instances 
of unexpected findings or ideas which emerged as a result of the 
research process. These have been consistently followed up, and have 
often led to considerable increases in our understanding. 
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Figure h 
Suaasary of Experimental Program 



Experiment 
I. Basic 



Research Question 

task development, experi- 
mental design 



Ss 

gr. 3>^: black oity; 

white rural 



2. Secondary 



generality of phenomenon; 
sociological subgroups 
(sex, race, class) 



gr. 1-4: white suburban 



3a. Special Control 
3b. Racial Match 



3c. White, Suburban, 
Female 



"behavior modification" 

"hostility" vs. 
"debilitation" 

contrast effect 



gr. 3} 4: white suburban 
gr. 3>4: black, city 



gr. 3} female: white, 
suburban 



Ua. Racial Mix 



race of evaluator 



kh. Mixed- Race Groups effects of integration 



gr. 1-k: white, rural; 

white, suburban; 
black, city 

gr. 3th: integrated, 
city 



3. Generalization 



raising e^gpec tat ions 
indirectly 



gr. 3}4: white, rural; 
white suburban 



6. Grading 



distribution of peer 
evaluations 



gr. 3|4: white, 

suburban 
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We do not consider however, either that ve are heading totrards 
a specifiable goal, or in fact that there is a definable enc^point. 
The program is continuing, as indicated in Section II of this report. 
In addition to the main work and the minor issues raised during the 
research, we have es^lored and finally abandoned some other avenues. 
Three of these are (1) an es^erimental task, "artistic Judgment 
ability," which did not meet criteria for the experiment; (2) the 
Draw a Man task employed in research by others (see Porter, 1971), 
which did not produce any reliable, intexi)retable results with our 
children; and (3) an experiment in which we hoped to demonstrate 
that children will allocate action opportunities to each other as 
a function of the expectation states they hold, which appeared to 
conflict with cultural norms against admitting differential evaluations 
of individuals. These topics are described more fully in Appendix H. 



Ik 



SECTION I 
4 AFFENDICES 

Published Papers Covering Experlfflents on Children's Expectations 

A. "Raising Children's Performance Expectations" A-ltoA-12 

B. "Research Notes: Status Factors In Expecta- 

tion Raising" B - 1 to B - 12 

C. "Raising Children's Expectations for Their Own 

Performance: A Classroom Application" C - 1 to C - 33 

D. "Expectations in Mixed Racial Groups" D - 1 to D - 23 

E. "Raising Expectations Indirectly" E - 1 to E - 10 

F. "E3gpectation Effects? in Perfomance Evaluations" . . . F - 1 to F - lU 

G. "Effect of a Principal's Esgpectations on Test 

Performance of Elementary-School Children" .... G - 1 to G - 6 

H. "Procedures niat Turned Out to be Unsuccessful" .... H - 1 to H - 7 
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Raising CiiUdrcn\s Performance Expectations' 



DORIS R. ENTWISLE and MURRAY WEBSTER. JR. 
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An experimental procedure to increase children's rate of hand-raising 
is reported. The procedure was tried with 240 grade school children from 
two residential areas and is apparently efTective. This procedure, involving 
raising children's expectations for their own performance, differs in ap- 
proach from previous work raising teachers' expectations. Similarities be- 
tween the procedure and laboratory research in expectation theory are 
noted. The experimental results are uvful in interpreting educational 
research designed to raise teacher's expectations and also useful in suggesting 
further work along similar lines. 



Several investigutofs have reported that grade school children can be 
affected by the expectations held for them by their teachers (t.g., Rosenthal, 
1968; Meichenbauni, Bowers, and Ross, 1969; Bropliy and Good, 1970). One 
person's expecl;ilions of another's behavior may act like a self-fulfilling 
prophecy. Previous work in this area has not been systematic, however, in the 
sense of explicitly specifying the determinants, the definition, or the con- 
sequences of these performance expectations. For example, some of the 
studies cited point to the importance of expectations in educational settings 
without attempting to specify precise mechanisms at the individual level that 
alter expectations. In many of these studies teachers are given false informa- 
tion about stuHcnts* test scores and then students subsequently perform in 
accordance with the false scores. We presume the teacher raises his expecta- 
tions for the student when high (false) scores are given. Then this revised 
opinion causes the teacher to act so as to allow the student to patticipate 

•This work was supported by Nlil Grant MIM8183-01. OE Grant OEG-3-7I-OI22, 
and by the Center tor Social Organization of Schools.The Johns Hopkins University. We 
thank Margaret Boeck:nann for her help devising ihc experimental procedure, and Ellen 
Dickstein und Ujrbara Bricks for their help with the field work. Special thanks are due to 
Mr. Jjck tpsicin jnd Dr. George Gabriel for arranging for this research in schools. 
Principals who cooperated in the research helped in many ways. They are: Mr. Lyman 
Huff, Mr. Willijm II. Magzis, Mr. Daniel Rickowiak, and Mr, Samuel Sharrow. Mr. Elliott 
Epstein and teachers and students of School S3, Ualtimorc City, were of great help to us 
in pilot studies preceding the experiments. 
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moret and more often to give positive evaluations of performances of that 
student. Once this chain of events gets started, there are powerful forces to 
cause it to continue. The student, perceiving the teacher^s high expectations 
for him, tries to justify those expectations. More importantly, the student 
probably raises his expectations for his own future performance. With his 
expectations for his own performance raised, the student is more likely to 
participate in class, to receive favorable responses from others, to have a high 
opinion of his own ability, and the like. A central feature of this sort of 
explanation is the circularity of the process involved: raising the teacher*s 
expectations for a given student causes the teacher to treat that student 
differently to be more likely to call upon him in class and to evaluate 
positively anything he says. Hiis in turn causes the student to raise his own 
expectations for himself, and thus to behave differently -to attempt to answer 
questions more frequently, to be more confident of his. answers, and to expect 
(and actually to receive) more agreement from other students. If a teacher is 
given information which leads to the formation of high expectations for a 
particular student, it is likely that the teacher will treat the student in a way 
which will raise the student's expectations for his own performance. If this 
happens, the student will then behave in a way which will be positively 
evaluated by the teacher, and consequently in a way which will cause further 
raising of the teacher's expectations for the student. The expectations which 
are held for students, therefore, are extremely important determinant, of 
future interaction and they are partially independent of **objective*' evalua- 
tions of performance. Furthermore, expectations of a given level tend to be 
self-maintaining, for once they exist, they affect the very interaction com- 
ponents which determine expectation levels. 

The main contribution of the present paper is reporting of an experi- 
mental procedure to raise directly children's expectations for their own 
performance and to do this without resorting to deception. Previous classroom 
research, as noted above, has attended to teachers' expectations, but children's 
expectations may be of equal, if not greater, educational signiflcance. If 
expectations of children are to be altered in ways that are of use in the actual 
process of education, the alteration cannot rest on the giving of false 
information (as in the Rosenthal studies) or on the lowering of some 
children*s expnofattons so that others* can be raised (as when children are 
randomly assigned to high or low tracks within a school). The procedure 
reported here seems to avoid these drawbacks. 

The experimental task developed was a modification of a storytelling 
task previously used in research on cognitive development of children 
(Entwisle, Grafstein, Kervin, and Rivkin, 1970). In addition to meeting the 
major criteria listed above, it also meets practical criteria: (1) it is similar to 
activities that ordinarily occur in classrooms; (2) it provides discrete, easily 
observable |M!rfurmance outputs that permit clear evaluations; (3) it is 
interesting ciunigh to capture children's attention. 
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RAISING I'XPUCTATIONS 



METHOD 

Subfccts. Children were drawn from four schools in the Baltimore area, 
two inner-city schools with nearly 100% black students, and two rural schools 
with nearly 100% while students. Hie rural schools are located in a farming 
district 30 miles north of Dallimore, near the Pennsylvania line. For the two 
inner-city schools, only students whose school records showed tested IQ scores 
between 90 and 110 were selected for the study. Hie IQ range for rural 
students is considerably lafger, from 76 to 141, but the mean IQ is 105 for 
third*graders and 108 for fourth*graders. Experimenters were middle class 
white persons in the 20-30 age range, both sexes. 

Procedure. At the beginning of an experimental session, children were 
brought together and told that the researchers were looking for people who 
could tell good stories. They were to be divided into *'teams" and were told 
that the team which made the best stories would win a prize. Then one 
experimenter took the members of each team to a separate room and 
described the storytelling task to them. Members of a learn were chosen so 
that children on one team c^ne from different classrooms. 

Tlie experiment has three phases. In Phase I, the **baseline'* level of 
volunteering (hand-raising) is determined for all children; in Phase 11, the 
attempt was made to manipulate upward the self-expectations of the child 
selected as the 'Vxperimcntar* child; and in Pliase III, the rate of volunteering 
(liand-raisiitg) was again measured. The Phases are summarized in Table 1.2 
Children were told that the **game" consisted of making up a story. The 
experimenter would help by starting sentences, but then the children should 
try to make interesting stories by supplying ''good** words when asked for 
words for the story. In each instance the same story skeleton ($ee Chart 1) 
was used, and the •skeleton w; > missing 12 words. Children supplied these 12 
words. 

Children were told to listen carefully while the sentence was being read, 
then when the blank was reached, to try to think of a good word. Anyone 
who thought of a good word was to raise his hand, and the experimenter 
would select one child to give the ''leanrs word** for that sentence. Children 
were cautioned not to raise their hands unless they Ihouglit they had a good 
word, for if they were cdX^^A on and gave a ''bad** word, this would hurt the 
team scores. The experimenter allowed 30 sec to elapse after reading the 
sentence before calling upon a child. 

2a few children in Ihc inner city schools told two siorics in each phase. Only the 
flrsl story in each phase was evalualcd to make the procedure as consistent as possible 
with the proivdnrc t'ollowed for most of the inner city children and fnr all of the rural 
children. Also in a few insLnices for inner city children there were Tive rather than three 
itmlrol i:rou|> Mthjects in an e\|H.*rinient. In such cases Iwo subjects |H;r |*roup have been 
discarded using a laiidoni procedure. 
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TABLE 1 
Summary or Experimental Procedure 



ControLVs 



Experimental 5s 



Phase I 



Phase II 



Phase III 



One story is produced Control 5s have story 
(12 words); iio evalua* read to them by an* 
tions: level of volun* 
teering observed 



other adult; from 12 
to IS control 5s join 



in one group 



Repeat Phase I, with 
same control 5s and ex- 
perimental 5s as in Phase 
I; experimenters are ro- 
tated so the experiment- 
er is unaware of identity 
of experimental 5s 

One story is produced Experimental 5s make Repeat Phase I, with 

(1 2 words); no evalua- up story individually same control 5s and ex* 

tions; level of volun* with the same experi- 
teering observed 



perimental 5s as in Phase 
menter they have seen I; experimenters are ro- 
in Phase I; receive all tated so the experiment- 



positive evaluations; 
experimental 5s Join 
control 5s at end 



er is unawariB of identity 
of experimental 5s 



CHART I 

Story Skeleton with Samsilc Entries from a Rural Group 

There was once a (very tall prince) _ 

Who had a i^'^K) 

That (lie, she) (lived in)_ 

One day (he, %he) had i4> go to (the du ngeo n to se c hb prisoners) 

(lie, she) did this very (angrily) 

Because (he, she) wanted to (make sure they were there) 



This was very dangerous because of the (strong prisoners) 

Who (which) was (were) very (mean) 

In order to fool the (fill in) the (fill in) dressed up 

Asa (another prisoner) 

In such a disguise the (fill in) looked (mean) 



And when the (fiUin) saw the (fill in), they (welcomed him) 

This made the (fill in) (feel pretty good) 

And /complete story/ (he let his new friends go.) 
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RAISING CXPGCTATIONS 

Before calling on a child, the experimenter recorded privately which 
children were holding their hands up (the measure used for the child's 
expectation state). The experimenter held a clipboard so that children could 
not see what was on it. The clipboard was used for recording words given by 
the children. While the 30-scc waiting period was occurring, it was easy for 
him to make small marks indicating which children had volunteered without 
the students* being aware of his recording. 

Phase I consists of an initial story being produced as just described. The 
experimenter did not evaluate any of the words given durirg Phase I. He 
aiM upon cacii child for Phase I approximately the same number of times. 
With 12 words to supply and four children playing, this permits each child to 
be called upon three times. In only a few instances did any child volunteer 
less than three times altogether and so unbalance the selection of respondents. 

At the end of Phase 1 children were told that they should now return to 
th jom where they had initially assembled. After they began to move out, 
the experimenter said quietly to one child (chosen because his level of 
hand-raising was near the median for the group) that he should stay in the 
experinjenlal room and wait ihsie foi a minute for the experimenter. After 
the experimenter made sure that the control children were on their way to 
the proper destination, he returned to the room and to the selected 
(experimental group) child. The experimenter then told the experimental child 
that he would have an opportunity to make up a story all by himself; also 
that the experimenter had played with many children making up stories and 
that the experimenter thought he (the cxperimemal child) was really very 
good at the task. Tlien a story-skeleton with a new lead word was niled in 
orally by the child, as before, but of course there was no volunteering. The 
child merely supplied a word when the sentence was read up to the blank. 
After each word was supplied, the experimemer indicated approval vigorously 
-by smiling, by nodding, by commenting a "very good word"-**good"- 
"thal's interesting!", etc.; i e.. he indicated approval in every way possible 
consistent with sincerity. 

When Phase || storytelling was completed the experimental child went 
and joined the control children in Phase II. Phase II for control group children 
was a storyreading session. Cliildren from seveial game groups gathered in a 
central room as they fmished Phase I. The storyreading went on continuously 
and experimental group children also joined the storyreading group as they 
finished liieir Phase || activities. The experimental children thus listened to 
the end of the story being read to the control groups. The experimental 
children s entrance was not noticeable because they were part of a large group 
where other members had also arrived at different times. The storyreading 
prevented commupication among the children in Phase II. At the end of the 
storyreading. the children were told to "go back to the room where you were 
before. ' AN children thus returned together. At this point experimenters were 
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rotated among rooms so each experimenter had a new group and was unaware 
or the identity or the experimental child. The length o( each phase varied 
with the Phase II control procedure adjusting its length to the needs of each 
set or experiments. 

Phase III, except Tor the rotation o( experimenters, was a repetition of 
Phase I. The four children made up a story; the experimenter remained 
neutral and recorded unobtrusively how many tinteseach ^ild raised his hand. 

RESULTS 

The results of the experiment are given in Tables 2 and 3.3 The data 
consist of gain scores for each child, the increase in the number of times each 
child raises his hand in Phase III compared to the number of times he has 
raised it in Phase I. The differences between experimental and control group 
means are as predicted, and increases in average volunteering rate vary from 
0.97 to 2.03. The analysis of variance in Table 3 shows that this difference in 
response to treatment is highly signiHcant. None of the interactions is 
significant; the response to treatment does not vary significantly by grade or 
residential locus. 

DISCUSSION 

Results suggest that the experimental procedure devised does lead to 
changes in rales of volunteering, which, we believe, represent changes in 

TABLE 2 

Average (Sain in Rale of Volunlccring from Phase I to Phase III For 
Experimenlal and Conirol Groups OVs are in Parentheses) 



Inner cily Ruial 





Grade 3 


Grade 4 


Grade 3 


Grade 4 


Experimenul group 


I.8S 


1.90 


2.60 


1.90 




(20) 


(10) 


(20) 


(10) 


Conirol group 


0.88 


0.20 


0.57 


-0.07 




(60) 


(30) 


(60) 


(30) 


Diffcfcncc bclvkvcn groups 


0.97 


1.70 


2.03 


1.97 



^Altogether 67 obitcrvations were discarded lo yield proporlionji subclass numbers. 
Each subclass is approximately balanced for sex. An earlier least-squares analysis of the 
unbalanced data is consistent with findings report ed here. Strict additivity holds for the 
case of pru|H)rtiunjl subclass numbers. ;ind as in the presi*nt analysis, all effects are 
orthogonal. Ilert (irean kindly supplied a proof fur the orthogonality of proportional 
subclass numbers fut three variables. 
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TABLIv 3 

Variance Analysis: Cains In Rate of Volunteering .«ural and 
Inner (?ity (liildren, Grades 3 and 4 



Source of variance 



Experimental vs. control treatment (T) 

Residential locus (R) 

Grade (G) 

TX R 

TX G 

RxG 

TX Rx G 

Within T-R-G groups 





Sum of 


Mean 






Squares 


square 


value 


1 


116.81 


116.81 


18.19* 


1 


0.60 


0.60 


0.09 


1 


17.63 


17.63 


2.75 


1 


7.20 


7.20 


1.12 


1 


1.12 


1.12 


0.17 


1 


0.30 


0.30 


0.05 


1 


1.S8 


1.S8 


0.25 


230« 


1476.72 


6.42 





•Two misMng observations were estimated using appropriate subclass means. 
^Beyond tiic 0.01 level. 



sclf-expcciatlon states. In no previous work has there been an attempt to alter 
children's expectations directly, and in all previous work there has been major 
reliance upon the use of false information. While techniques using false 
information may be very effective and consequently useful for research 
purposes (cspcciiilly in laboratory studies where the deception involved may 
be explained iinincdialely afterward), false information is clearly not desirable 
for long.lcrm use in applied research. In studies where expectations are raised, 
there are pructicul diinculiics associated with the continued use of false 
information; in studies where expectations are relatively lowered, as in some 
tracking studies, there are mural difficulties. The maneuver we have described 
involves an adult interacting with a child on an individual basis. The adult 
gives consistent positive evaluations of performance in a task where the child's 
actual ability is almost irrelevant. The evaluations are therefore not incon- 
sistent with anything known about the child or with his potential. To the 
extent that the fjianeuver is successful in changing the child's behavior, it 
should improve his performance at a wide-variety of tasks. 

The nature of the experimental maneuver. To clarify the nature of the 
experiments reported here it is important to define exactly the experimental 
procedure so as to rule out rival explanations for the results. 

Maneuvers that give fictitious results to students or to teachers may be 
defended when they are one-shot procedures to demonstrate self-fulfilling 
prophecies, placebo effects, and the like, or when they are needed to allow 
quick and sizable manipulation of variables in the laboratory. Other methods 
must be sought, however, when the goal is to change children's long-range 
expectations for ihemsclves. or others' expectations for children in classroom 
settings. If cxpcciation alteration is seen as one goal of education, ways must 
be found to altci expectations that can be defended over the long-term and 
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that are compatible with other educational goals. The procedure described 
here has achieved some modest success along these lines. One issue, that or 
biasing of results, is particularly salient, for it is well-known that even a slight 
change in manner of a teacher can alter signiflcantly the likelihood that a 
child wUI raise his hand. A number of steps regularly were therefore taken to 
minimize the effects of any systematic influence from the experimenter. First, 
in Phase I of these experiments, Es were trained to treat the children as 
equally as possible and to refrain from evaluating any of their performances. 
The child selected as the "experimental** child at the end of Phase I was 
chosen on the basis of his having responded close to the median for his group. 
To choose a child with too low a response rate in Phase I might have biased 
results in favor of predictions through a **regression to the mean** phenom- 
enon in Phase III. To choose a child with too high a response rate in Phase I 
would have biased results against predictions, because of a ""ceiling effect** in 
the measure of expectation states used. To avoid drawing attention to the 
child selected, the E did not say anything to that child until the others were 
on their way to the other room for Phase II; nor did the E explain at all to 
the others why this child was asked to remain behind. All children are with 
an experimenter during Phase II, and the experimental group children 
participate in at least part of the control treatment of this phase. 

The major danger of biasing would come in Pliase III, however, if the E 
responded more warmly or more positively to the ^'experimental** child. In 
addition, this child might feel that, because of the individual session with the 
E in Phase li, he had some "special** relationships with him.^ In order to 
avoid problems of this nature, Es were rotated before the beginning of Phase 
III. Tlius the was new to the group in Phase III, unfamiliar to both the 
experimental and control children. Furthermore, the E did not know at this 
point which child had been given the Phase II "treatment,** and thus was 
unlikely to treat the children differentially. In Phase III. as in Phase I, Es are 
trained to call on every child as nearly as possible the same number of times. 

Probably the major alternative interpretation of these results is one 
which would describe the Phase II procedure as reinforcement of behavior 
rather than positive ev aluation. In other words, a major alternative explanation 
for these results would not use the idea of expectations at all, but would 
argue instead that the experiment demonstrates that childen will increase the 
rate of ejnission of behaviors which have been rewarded in the past. 
Interpreting the results of this study in terms of behavior reinforcement would 
mean arguing that there is no reason to believe that the experimental group 

later series of experiments (forthcoming) with additional control groups rules 
out explanations based upon isolation with the experimenter. This result has also been 
found in highly controlled laboratory studies by Professor Barbara Sobieszek at the 
University uf R<Khester. We thank Professor Sobieszek for making her daia available to 
us prior lu puhlii;ttion. 
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children changed their conceptions or their own abilities, or that they would 
be more likely to display the behavioral consequences of a raised self-concept» 
such as increased confidence and willingness to try to perform in other 
situations. In assessing such an explanation, it is important to note that our 
measure of expectbtion states in Phases I and II was rate of volunteering, or 
hand-raising. What were positively evaluated in Phase II were words spoken by 
the children. At no time during the experiment was anyone positively 
evaluated (or reinforced) for hand-raising. Furthermore, the speciflc words 
evaluated in Pliase II were seldom the words volunteered in Phase III. Thus, 
an explanation based on the child's expectations is more parsimonious than 
one based on a reinforcement paradigm. 

Additional support for the interpretation given our experiments is found 
in experiments which measure children's expectations by using a kind of 
self-rating scale. They consistently report that expectations are increased after 
social reinforcement by an adult, and are unchanged when an adult maintains 
a neutral role (Hill and Dusek, 1969; Crandall, 1963; Crandall, Good, and 
Crandall, 1964). The actual procedure used by Hill and Dusek was very similar 
to our own, for an adult responded ^That's good. Fine. Very good. You're 
doing well" for positive reinforcement following attempts at an angle-matching 
task. Tlie adult was neutral and nonresponsive in their nonreinforcement 
condition (like our Phases I and III procedures). Other studies (Maehr, 
Mensing, and Naf/ger, 1962; Videbeck, I960) showing that if an individual 
gets approving reactions from others with respect to some specifled attribute 
he will improve his self-rating on that attribute, are also consistent with the 
conclusion that children's expectations liave been improved by our experi- 
mental maneuver. 

With respect to alternative interpretations of the results of the study, 
obviously we cannot rule out all rival explanations with perfect certainty. 
However it seems difficult to construct an explanation which accounts for all 
the results of this study, as well as for the results of others' work mentioned 
earlier, as satisfactorily as does an '"expectation" interpretation. For example, 
it might be argued that the effect of the treatment of the experimental group 
child in Phase II was to give him additional practice with the task of 
storytelling, and that it was the effect of the practice which produced the 
increased confidence in his ability to tell stories. This would mean that the 
positive evaluations of performance in Phase II were unimportant, and 
possibly that the level of expectations as determinants of behavior in Phase III 
were irrelevant as well. But it is important to note that the task is one which 
calls for no special ability, and certainly not one which can be ''improved'* 
through practice In the absence of performance evaluations from £*, it is 
probably difficult for children to decide whether even their own words are 
''good ones'*; the subjective impressions of the experimenters in the "unevalu* 
ated" Phase I corroborate this. 
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Relation to previous research. Some research Co which the present study 
is closely related is laboratory work in expectation theory by Berger and his asso* 
dates (l%l, 1966, 1968, 1969). Expectation theory assumes that in task situa- 
tions positive ex|ieccations are communicated by a source (an experimenter or an 
evaluator)who|iives positive or negative evaluations of performances. Another a^ 
sumption is Chat Che person receiving che evaluacions comes to hold an expecta- 
tion scate chat is in accord with those evaluations. One can then derhfe the propo- 
sition chat if one person receives a large number of positive evaluations (experi- 
mental group children) while a second person has not received any evaluations 
(control group children) then Che first individual is more likely to accept an 
action opportunity (raise his hand) Chan the second person. The increased 
acceptance of action opportuniciei would be seen as an improved state of 
self*expectation or of 5elf*evaluacion. The present experiment operationalizes 
variables in a way similar Co the way such variables are operationalized in the 
laboratory. The present experiments in naturalistic settings are entirely 
consistent with this laboratory work. (The current work of Cohen and 
associates (1970) involves applying the same tl<eoretical perspective to a 
different set of substantive problems.) 

Other recent work analyzing behavioral concomitants of expectancy is 
also consistenc with both Che laboratory work and our work. For example, 
Meicheiibaiiiu ct al. observe chat expectancy instructions (identification of 
*1ate bloomers**) cause some teachers to increase positive interactions with 
studenCs, or Co decrease negative inccraccions. Posicive interactions included 
convcyini* encoiiragcmeiU, praise, or any attitude of satisfaction. Also Brophy 
and Good (1970) observe ChaC flrsC grade ceachers Cend to demand more of 
those children lor whom they hold high expectations, and to praise these 
children more when tlicy do respond. Students seen as low are praised less 
and held to lower standards. In other instances when expectancy or teacher 
expectancy experiments have failed to achieve results that were anticipated 
(Goldsiniili, 1970; Fleming and Anttonen, 1970) an analysis of findings in the 
light of the laboratory work of expectation theory may be helpful. For example, 
it may be tliat the teacher does not alter expectancy because students do not 
accept the validity of the teacher*s expectations. 

Educational implications. The major result of this work is the experi* 
mental task itself. It offers a means of manipulating (raising) young children's 
expectations that is not subject to the drawbacks affecting ways of manipulat- 
ing expectations used previously, and even more important, the raising of the 
expectations of the child himself i: probably a more direct and more general 
way of improving educational performance than the changing of other 
persons' expectations for the child. Tlie behavior influenced - frequency of 
hand raising is generally considered to be important educationally. Hand- 
raising leads lu gieater participation. The importance of active participation to 
learning is too well kiu>wn to require documentation. In future reports we 
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hope to present the resuhs o( studies of the degree of effect of this procedure 
upon racial, age, and SES subgroups of the population. Further work will 
extend the basic experimental design to examine the relation between 
expectalron change and structural variables like sociometric standing. 

The present. experiment includes children from grades 3 and 4, from two 
very different residential loci. Since there are no signiflcant differences by 
locus or by grade but a highly significant gain in rate of volunteering when 
experimental children are compared to control group children, it seems likely 
that the treatment devised is an effective one for many children of this age 
level. 

Assuming future work agrees with results reported here, one must 
evaluate the potential of this procedure for educational purposes. There is 
value first of all, in making explicit parallels between social psychological 
research and research in classrooms. As has been so frequently noted, there is 
a surprising gap between the two fields of research thai hinders both. By 
linking classroom research to mure tightly controlled research, one gains 
coherence and explanatory leverage on a whole body of research fmdings. 

Tlie viability of the experimental maneuver reported here for securing 
long*term educational effects may not be large, however. The present pro* 
cedure, as pointed out earlier, does have the distinct advantage of avoiding the 
use of false test scores or of any form of outriglu deception. On the other 
hand, it is difficult to imagine using such a maneuver over and over to obtain 
general effects in raising children's expectations. Yet the procedure may have 
usefulness over the short term in two ways: (I) Tlie procedure may act as a 
pump primer. Tliat is, if a child's expectations for himself are suddenly raised 
as in the experiment, he may alter his actions in ways consistent with his 
increased expectations for himself. Any intervention within the circular series 
of events-improved self-expectations leading to better performance leading to 
improved teacher evaluations* may be effective. The short-term change in a 
child's actions induced by an expectation*raising procedure might thus fire a 
chain reaction that would tend to continue. (2) This procedure and others 
that could be invented may give precise suggestions as to how teachers can 
convey positive expectations. Some teachers may wish to convey positive 
expectations but not be very adept at it. The suggestion here is that 
encouragement for one activity (word-giving) leads to an increased level of 
another activity (hand-raisii.g) at a later date. In classrooms where many 
activities occur, then, the teacher need not wait, perhaps in vain, for a 
praiseworthy performance in arithmetic to improve a child's expectations for 
himself in the area of arithmetic. If the child can be reinforced for 
performance of another kind, this may generalize to yield increased participa- 
tion across the board. In fact, some situations may be used primarily to 
improve expectations rather than for learning per se 
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Research Notes: 

Status Faaors in Expectation Raising* 



Doris R. Entwisle 
Murray Webster, Jr. 

The Johns Hopkim University 



Previous experiments with grade school children, designed to raise their 
expectations for their otan performance, were repeated with pupil subgroups 
formed according to race, sex, and age. Positive results of the earlier studies 
were reproduced, and further analyses are presented to auess the effect of 
some status factors in the situation. The experimental procedure was effective 
across the subgroups studied. The relation of this luork to ''teacher expectancy" 
studies is discussed, and some implications of the results, in terms of other 
studies on expntations and tlie structure of competition in schools, are drawn. 

p£RFOKMANCK KXPECiATiONS, LIKE othcr bclicfs individuals hold 
about themselves* can act to produce a self-fulfilling prophecy. At 
the most general level, people who expect success in a given task 
are often more likely to meet success than arc people who expect 
to fail. Behaviors such as sustained attempts to solve a problem, 
expressions of self-conndencc, frequent verbal interaction, and 
refusal to accept innuence from others arc thought to be deter* 
mined, in pait» by an individuaKs expectation level and, in turn, 
to be partial determinants of his actual success. 

Expectations for success are of continuing interest in educa- 
tion. For a long time it has been felt that the child's expectations 
for his own success and the expectations which are held by 
significant adults (such as teachers or parents) for the child's 
success are important determinants of liis actual performance. 



*Much of Uiis rcscurch was siipputtnl by Grant No. MH I8I8S. Prcparaiioii 
wai fadliiaicil Uy nraiil OKC.-3-7I-OI22. Special thanks aic due lo Dr. Evart Cornell, 
principal, and the teachers and stndcnls at Sronclcigli Elementary School for their 
help in these rxpriimcnts. 
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Expcdaiioiis. either :i% ilic major inde|)eiident variable or as an 
imcrvciinig variable, have been invoked as the explanation for a 

arge iiuiiibcr of resiihs involving children's academic performance, 
(^ee Meichcnbauni, Howers. and Ross. 1960. for a recent review 
of much of the related literature.) 

Some investigators have demonstrated positive effects by 
artiluially raising cx|)eciations held for specific children (for 
examjjlc Koscnihal and Jacobsoii. 19G9), but such research has 
produced contradictory and unexpected findings as well (for 
example. Cllaiborn. lOCO). The inconsistencies in exijectation re- 
scarth ;nc ptiplcxiiig. This research has not been guided by ex- 
plirit theory spnifyiiig how cxpcciations may be raised or the 
condiiioiis under which various ircalinenis will be successful or 
unsiia essfiil. Typit ally iliere has liccii neither any way of knowing 
exactly why a given procedure should work nor of interpreting 
iicgalive results when a prtucdiiie did nut work as predicted. The 
absence of explicit theory may he an important reason for the 
mixed results. 

The child in the classiooni is in a situation similar to that 
studied III highly tdiiirollcd laboratory expcrimenis on expecfa- 
ttou Ihcoi^ by Uciger and his associates (Bergcr and Snell. 1961- 
Bcrgcr and Coiiiuir. I9fi9: Hergcr. ct a!.. 1909). The child is acting 
as a member of a prtiblcm solving group, he usually is motivated 
to do well, and he perceives that the task of the group as a whole 
IS to solve a series of jiroblems. Hecause of these similarities, we 
felt that research attempting to raise children's expectations could 
be fashioned along the same lines as laboratory experiments in 
expectation theory. (See Entwisle and Webster. 1972. or Webster. 
1969. for a siatement of die propositions of the dieory) 

We dcsigncti a study based upon the propositions of expecta- 
tion thetiry. ticvising procedures that proved to be successhil in 
raising thihlrcn's eX|»eciaiions for tlieir own jwrformance. In an 
earlier rep«iii (l-ntwisle and Webster. 1972) we discussed task de- 
vclopinciii. Mime initial experiiiiciits. and the relation between 
ediicaiional (ield settings and lalMiralory si^vtings. This first work 
was essentially exploiaioiy and psyt hdUigiVal in nature: we were 
ctiiuerned with tievehipiiig effet live means of applying the theory 
to the learning sitiiaiiiiii and with determining whether we could 
alfet t the chiUl s cognitive expet tations for his own performance. 
Here ue repent lesiilis from further work designed to examine the 
effect of sciciohigieal variables: whether, for example, the results 
of the earlier study could lie replicated using a different sample 
and across vari.iti«ins <if age. race, and SF.S subgroups of the school 
population. 

To review bricdy the earlier study, wc devised a three-phase 
exjierimental ti eatnieiu m increase the self-expectations of children 
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and we ineasuicd the child's cx|)Cct:uion level by the rate of accept- 
ance of action op|)ortunities, that is, hand-raising in response to 
questions. Children were formed into groups of four; each group 
met with a single experimenter. They were told that their task 
was to help mak^ up stories and their group was in competition 
with other groups to make up the **best'' or most interesting 
stories. The children were asked to supply words to fill in blanl^ 
in a "story skeleton" consisting of 12 sentences with one word 
missing from each sentence. 

The invention of each of three stories constituted a **phase'* 
of the experiment. In phase I, the experimenter was neutral: he 
recorded the proportion of times each child raised his hand and 
called ccptally on children who volunteered to supply the 12 
missing words. In phase II, one child constructed a story by him- 
self (experimental group), and the other children (control group) 
went to another room where they played a game or had a story 
read to them. Kvei7 word the experimental group child gave in 
phase 11 was evaluated positively by the experimenter; according 
to tlie theory, this proc edure should have raised the child's expecta- 
tions for himself and, consequently, his rate of hand-raising. In 
phase III, the origin;3l groups were reconstituted, ex|)erimenters 
were reassigned to groups to prevent unequal treatment of the 
experimentil group child, and the measurement procedure of 
phase I was repealed. 

In the initial studies, the experimental group child in phase 
II generally increased significantly the rate at which he volun- 
teered words between phase I and phase III; the increase was 
signiHcantly greater for experimental children than for "control" 
children. Thus, results of the initial studies were encouraging — 
the procedure and the theory seemed to work as predicted. The 
research iitcludrd two groups of third and fourth grade students. 
One group w:is black and lives in the inner city of Baltimore; 
another gioup was white and lived in nnal Maryland. These 
groups were diverse in many res|K*( ts sueh as life style and family 
enviionnu'ui, but in other ways they resembled one another — 
(or example, in economic level and educational problems. 

This paper reports an extension of the earlier research in 
two iin|)ortant directions: first, by studying white, middle-class, 
suburban children, the social class or sul)cuttural group dimen- 
sion uas exicilfked; second, by studying children from first and 
second grades, as well as grades and 4, the age and amount-of- 
schooling dimension was extended. These extensions when com- 
bined with liie earlier work now provide data for third and fourth 
grade children from three widely different cultural backgrounds 
(black ghetto, uliii** suburban, white rural) and give some notion 
of how youni^er children (grades 1 and 2) react. 
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The studies reported here are based on a three-phase expert- 
mental design, where the second phase included a ]}rocedure to 
raise expectations of a randomly selected child. (The reader is 
referred to Entwisle and Webster, 1972, for complete de- 
scription of the design.) Subjects included approximately equal 
numbers of boys and girls, with 79 first graders, 84 second graders, 
112 third graders, and 103 fourth graders. All members of the first 
four grades in a single suburban school participated. Some groups 
were later eliminated randomly to produte proportional numbers 
of children in the various subclasses. Each story-writing team, as 
before, consisted of members of a single sex and grade. Insofar as 
possible, the four members of each team were chosen from dif- 
ferent classrooms. For some grades that were distributed among 
three classrooms, two children were taken from a single classroom 
and the remaining two from t^vo other classrooms. 

Results 

Table I shows the mean gain in rate of volunteering from 
phase I to phase III fur both the exi>erimental and the control 
groups. These groups are t)alan(*c(I approximately for sex. Three 
sorts of changes in rate of volunteering are reflected in Table I. 
First, all groups, control as well as experimental, showed some 
increase in rate of volunteering. An increase for the control 
groups could have been due to such things <is increasing familiarity 
with the situation and excitement over Mie competition. Second, 
for all grades, the increase was signiflcuntly greater for children 
in the experimental groups than for the control groups (see Table 
2). The iiu tease in the experimental groups is consistent with the 
predictions of the theory, and it constitutes a replication of the 
earlier finding with a different subject population. Third, second 
gr<ide children showed the greatest absolute amount of increase 
for the experimental group and also the greatest difference be- 

TABLF. 1 



Average Gain in Rate of Volunteering From Phase I to Phaie III. 
White ' iditic Class Students 





1 


Grade 

2 


S 


4 


Ail Grade* 


Expcrinu'iital (froii|> 


1.20 


2.55 


1.85 


1.85 


1.86 




(20) 


(20) 


(20) 


(20) 


(80) 


Control Cruup 


0.67« 


0.67 


1.20 


1.38* 


0.98 




(59) 


(60) 


(60) 


(59) 


(238) 


UifTcrciicc Ix'twrcii (* roups 


0.53 


1.88 


0.65 


0.37 


0.88 



* This mean is based on 59 rather than GO observations. 
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TABLE 2 

Analysis of Variance for Cain in Rate of Volunteering, White Middle-Clasi Students 

Approximately Balanced for Sex 







Sum of 


Mean 




Source 


<i.f. 


Square* 


Square 


F-value 


Expcrimciiial v*. Coiilrul Trcaimciit 


1 


46.81 


46.81 


6.02* 


Grade 


S 


82.42 


7.47 


0.96 


Trealnicnl x Grade 


8 


20.27 


6.76 


0.87 


Wilhin Trenimeiil x Cradc Croups 


SIO» 


2409.70 


7.77 





* Beyond iher>% le%el. 

^Two observalioiis are missing, estimated by subclass means. 



tween experimental and control groups in this study, but the grade 
X treatment intenution is not significant. The results of this study 
are consistent with the results of the previous study. Thus, the 
effects of increased ex|)C('lalions have been demonstrated for a con- 
siderably wider sample than previously. 

The restilts presented in Tnhles I and 2 suggest that the 
experimental procedure has l>een successftil in raising childreti's 
expectations m predicted. Children in the experimental groups 
increased their rate of voltinteering between phase I and phase III 
significantly more than did children in the control groups. Putting 
these data together with results reported for the previous samples, 
we concltide that the expectation-raising procedure is effective for 
a variety of stihgrottps in the poptilation: white rural children, 
black inner-cily children, and white suburban children. 

In Tables $ and 1 ftirther analyses are presented that combine 
data of third and fourth grade children in the present study With 
data proi'tired in the previous study. (The data for the earlier 
samples dilTL-r slightly from those reported originally — Entwisle 
and Webster. 1072 — in that data have been randomly discarded 
from some grotips to provide proportional subclass numbers. The 
exclusions do not introdtice changes in any results reported 
earlier.) Grade and treatment again are factors, and residential 
loctis may now be added as a third factor. Half of the data from 
fotirth grade ( hildren reported in Tables 1 and 2 have been elim- 
inated randomly so that data for white suburban children when 
combined with earlier data will yield a matrix with proportional 
sub class numbers. 

Table .*l shows that the treatment was effective for all sub- 
grotips of third and fourth grade children in Imth sttidies. With 
the exception of rural fourtli grade children, ail control groups 
showed an increase, but for no subgroup was the increase for 
control gvonp children as great as the increase for experimental 
group children. 
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TADLK 3 



Average Gain in Rate of Volunteering From Pliasc I to Piiasc III, 
for Children fron) Vluxc Residential Loci, Grades S and 4 





Suburban 


Inner-City 


Rural 




Grade 
3 


Grade 
4 


Grade 
S 


Grade 
4 


Grade Grade 
3 4 


Experimental Group 
Conlrol Cruup 


1.65 
(20) 
1.20 
(60) 


2.20 
(10) 
2.07 
(30) 


1.85 
(20) 
0.97 
(60) 


1.90 
(10) 
0.20 
(30) 


2.60 1.90 
(20) (10) 
0.57 ~0.07 
(60) (SO) 


Difference Between Croups 


0.65 


0.13 


0.88 


1.70 


2.03 1.97 



Tabic 4 shows ihai the overall effect of ireaimem was highly 
signifutaiu (|)<.01). There is no firm cvidente here of differences 
by grade or by residential lotus (residential locns approaches 
signilicaiico, p<.l(>), nor is any interaction of treatment with 
other factors significant. As in the earlier report, it seems fair to 
conclude tliat expectations were raised significantly in experi- 
mental group children compared to control group children. In 
addition, from these later studies we conclude that residential 
locus or grades does not signihcantly affect susceptibility to treat- 
ment because the treatment factor does not interact with these 
other factors. 

We alstj ran a short series of more elaborate experiments 
where an additional control group was formed. In Phase II of 
this series, one of the control children met with a second experi- 
menter, produced a story, and in every way received identical 
treatment to the experimental child's Phase II treatment except 



TABLE 4 

Analysis of X'ariance for Gain in Rate of Volunteering, 
Three Residential Loci. 1'wo (Wades 



Source 

I'reatmcnt (T) 

Residential Locus (K) 

Grade (G) 

TxR 

TxG 

RxG 

TxRxG 

Within I , R, C S-ibgioups 





Sum of 


Mean 




d.f. 


Squares 


Square 


F'Value 


1 


110.21 


110.21 


14.75* 


2 


43.62 


21.81 


2.92 


1 


0.87 


0.87 


0.12 


2 


22.21 


11.10 


1.49 


1 


0.50 


0.50 


0.07 


2 


35.77 


17.88 


2.39 


2 


2.27 


1.14 


0.15 


347* 


2593.15 


7.47 





• Uc)und ihc 1% Icvd. 

*lVo ob$ci vaiii>n$ arc miKStng; csiiiiiaUii by lubduu means. 
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that the experimenter remained neiitrnl. not giving positive eval- 
uations. Two other control children received the control treat- 
ment described earlier. It turned out that the gain in Phase III 
for these special control children did not differ from the gain 
for the usual control children, and that the gain for the experi- 
mental children .significantly exceeded the gain for both control 
groups. Since this more elaborate procedure leads to the same 
outcome, its continued use is unnecessary. It is a much more dif- 
ficult procedure to implement since it requires four cxperimenten 
for each group of four children whereas the less elaborate pro- 
cedure usi d in most of our ex|)criiiienis can deal with three grouiM 
of four children siiiiiiltiincoiisly with the same number of expcri- 
mciiiers. ' 

Discussion 

The results of this study extend the posijive findings of our 
earlier study and demonstrate that white middle class children 
respond to the experimental procedure by increasing the rate 
at which iliey raise their hands. Although there are other sub- 
groups tliai could be studied (for instance, black middle class) 
the range of children for whom the experimental procedure has 
produced increases in performance outputs suggests that the phe- 
nomenon probably is a general one. There are obvious similarities 
between the completion of a story under the experimenter's di- 
rection, the task in these experiments, and many tasks overseen 
by the teacher in the classroom. In at least a prelimhiary way, 
the scope of application of expectation theory now has been ex- 
tended to situations that frt(|ueiitly exist in natural settings: 
specifically, elementary education for children of various $ub- 
groii{>s. 

In leniis of ihe goal of iniKlifyiiig expectations in naturalis- 
tic settings. II IS iinpoitaiit lo point out h«)w the significance of 
social stains in ihis study dilfeis Irom its sigiiifiniiKe in other 
work relaied to expeciaii«»ii ihe«)ry. C.eiierally. studies of status 
characiensiu s and ex|jectaii«»ii slates conterii the effect of dif- 
fereiitinl siaiiis in assignniem of |M»wer and prestige within a 
small gnnip. 'I lms. Cohen |?)70) has studied the task pe.- 

forinaiice of mixed groups of black and while boys who attempt 
to solve a problem together, and Webster (1070^ has studied the 
effects of hiatus chararierisiics on the effectiveness of an evaluator 
In both Cohen's and Webster's studies, the significance of the 
status charartcristic is thai, under certain circumstances, differ- 
enttal conceptions of ability arise in accord with the different 
states of the diffuse status chamcterisiics. 

The focus «)f the present study differs. Here we have sub- 
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groups, all or whose members are equal with respect to a status 
characteristic (such as sexi «igc, and race) and our interest is 
whether the same experimental treatment will raise expectations 
equivalently for the various status groups represented in the study. 
Ihe intent, thus, is to look at the **demography** of expectation 
ability, particularly for naturally-occurring social status groups 
of kinds typically attending elementary school. Elementary schooiSi 
since thc^y draw from suvrotmding neighborhoods, and usually are 
not large tend to have students who are of about the same social 
class level. It appears that all children so far studied are respon* 
sive to this experimental treatment for modifying their expecta* 
tions; no signtficani differences by residential locus have appeared. 

Age level (over 4 grades) also w<is systematically varied and 
likewise may he viewed as a demographic variable. We do not 
find that the effect varies with age, but further study is neces* 
sary before age can be discounted. 

In the analyses reported here we have not observed signifi- 
cant effects of either the age/school grade factor or the rest- 
dence/race/SF.S factor. These negative flndings should not be 
interpreted to mean that tliese factors are unimportant, either in 
affecting the expectations children have for their own perform- 
ance or in attempts such as ours to raise these expectations. We 
might reasonably expect that the differential life experiences 
which are reflected in the facuirs of our analyses will produce 
differences in both the general level of performance expectations 
and in susceptibility to different sorts of treatments. Hoi«ever, 
what seems most important about the failure to find significant 
effects of these factors in our research is tliat, by comparison with 
the expcrimeulal procedure usvd to raise expectations, the social 
status differenc es are relatively unimportant. In other words, based 
upon the work so far completeci. it seems safe to conclude that the 
procedures developed for this situation are effective quite generally 
for children coming from very diverse social backgrounds. 

Relation to tkaciiek Kxi-KciANrv reskarcii. Results of 
teacher expeciancy studies have been disputed in some cases (see 
Barber and Silver, I0G8: Thorndike. 1969) and unequivocally 
negative in others (('laiborn, 1909: Jacobs. 1969: Fleming and 
Anttonen. 1970). Our experiments point to possible sources of 
difficulty in the teacher expectancy work. First, while most child- 
ren show slight increase in performance outputs even without 
much encouragement (the control groups), some individual child- 
ren fail to manifest increases in performance even with large 
amounts of positive evaluation. We have no rationale to explain 
this, since all expciiniental children were given the same treat- 
ment. Different experimenters, however, could produce different 
effects on children even when the actions and speech of an ex- 
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peritneiitcr are the same from child to child. One child, for ex» 
ample, might have icceivcd much praise in the recent past while 
another might he relatively deprived with respect to praise. We 
then would expect the latter child to be particularly receptive 
to praise by the experimenter: the same remarks would be fall* 
ing on two very different sets of ears. The more variability of 
this kind that there is among children, the more dilTicult it will 
be to demonstrate a significant effect. There probably are subtle 
kinds of teacher characteristics that decrease teachers' effectiveness 
as purveyors of expectancy effects, much as presumed subtle ef- 
fects may have operated to cause differences in our experimental 
treatment. 

Second, expectation theory assumes a task orientation on the 
part of students; it also assumes a task in which ability is equal 
or irrelevant. Doth assumptions may be violated often in the 
teacher expectancy, work, for many school children have little 
ego-involvement in academic pursuits and many already have 
firm ideas about their own supposedly low level of ability. Also, 
teachers' expectations are not the only expectations that are rele- 
vant. Children have some access to objective standards and to al- 
ternative others for evaluating their performances-parents, peers, 
principals— so the teachers' expectations for them may not be only 
one com|)onent in an overall set of expectation components. 

A third point made by Claiborn (1900) is that there may be no 
changes in teacher-pupil interaction, no classroom analogue of 
our phase II treatment. If teachers perceive pupils to be of high 
potential, teachers may alter their behavior, but not all teachers 
do so (sec Kranz, Weber, and Fishcll, 1970), and teachers vary in 
how they change their behavior to suit children's ability. For ex- 
ample, Kraii/, rt. al. (1070) show that some teachers behave sim- 
ilarly towards IiIkI) and average ability children but differently 
towards low ability children. Other teachers change their be- 
havior towards high ability children and manifest similar behav- 
iors towards average or low children. If, as in most teacher ex- 
pectancy experiments, a teacher is given false reports about 
students' abilities, she might or might not change her behaviors 
toward the designated children depending upon how her own 
behavior pattern is expressed. 

In our research, the effect has been largest for rural students 
and for suburban second graders, although for no case so far are 
these differences large enough to lead to statistically significant 
interaction effects. More work is needed to increase confidence that 
student subgroups are indeed as homogeneous as we have as- 
sumed in the present study. Among other things, there are rapid 
shifts in children's interests and capabilities over the grade-school 
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years. A task stiit.iblc for raising cxpcciaiions at one level may be 
inappropriate, or relatively iiiclfcctivc. at other levels. 

Some impUctttious of this ivork. As we noted earlier, the ex- 
pectations uliicli individuals have for their own performance can 
act as self fiilfiliing propliecies; people who expect to do well 
often will do better than those who expect to do less well. In 
addition to this general effect, there are at least two more specific 
ways in which increased expec tations may have important bene- 
ficial effects for children. 

The first is that the behaviors which are associated with level 
of expectation— that is. the observable manifestations of an in- 
divuhial's expectation level often are also used by teachers to 
assess performance of children. Moreover they may be related to 
the actual (iiiality of the child's activity in learning. For example, 
a child with high expectations is predicted by the theory to be 
likely to raise his hand often in class, to be more confident of his 
answers, and to be more willing to explore new or difficult subject 
areas. Since all of these behaviors typically are used by the teacher 
to assess performance of students, it seems reasonable to suppose 
that where other factors are equal, the higher the student's expec- 
tations for himself the more likely he is to receive favorable evalu- 
ations from the teacher, independent of any actual performance. 
Moreover, the behaviors associated with raised expectations are 
those associated with actual improvement of the learning situa- 
tion. C:iiil(liin who are willing to enter new areas, who select 
difTuiili piobleiiis instead of easy ones, and who participate fre- 
quently and .i< tively in < lass disi ussions probably actually do Icam 
more than (hose who do not. 

The .second effect of raised expectations, a motivational one. 
may be a long rim increase in the child's general level of self-con- 
ridence about nradeniic activities. This eventually could affect the 
sorts of college and career choices he will make. The higher the 
actual ability of other students in high school senior classes, the 
less likely is any given child to feel that he is performing in the 
top half of his class, independent of his actual ability or grades, 
(Meyer. 1970). Moreover, the independent effect of actual ability 
of others in determining likelihood of attending college in this 
same study was both negative and impressive: the gamma measure 
of partial association of others' ability and the college decision was 

What Meyer reports is related to the relative expectation of 
students and suggests how the structure of competition may aid 
or hinder performance. Competition of some kinds may be a 
powerful liuentive and lead to high levels of performance. Coir.- 
petition of orher kinds may be debilitating. Competition can be 
employed rationally as an incentive for learning, but under pre- 
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sent conditions it is hard to know when or how. Being a member 
of a school class where other members have uniformly high ability 
probably lowers the individual's level of expectation for himself 
(indcpeudeut of his actual ability)^ and the lowered expectation 
level for self iu turn decreases the likelihood of attending college 
by a significant amount (again» independent of the actual ability 
of the child). Kxteuding this theory^ the higher a child's level of 
expectation for himself, the more likely he is to attend college, 
again, independent of his actual ability* If this proposition is true, 
then the effect of increasing children's expectations, by itself, 
should have measurable and beneficial results. However, it is im- 
possible to know just how much treatment, and of what sort, 
would be necessary to produce the expectation changes indicated 
in this analysis. 

A similar effect has been described by Davis HOGG) as the 
"frog pond" effect. Among a group of college students of uni- 
formly high ability (National Merit winners, finalists, and semi- 
finalists), the higher the general ability level of the other college 
students, the less likely was the individual to choose to enter a 
''high-performance major field" (as opposed to vocational and "ad- 
justment" majors). These results are consistent with those re- 
ported by Meyer, and it is reasonable to interpret them similarly 
in terms of expectation theory. The effect of having as significant 
others persons of uniformly high ability is likely to be the lower- 
ing of the individual's self-expectations and the consequent with- 
drawal from "tough" competition. People tend to avoid, when 
{X)ssible, activities in which they feel they will show up poorly. 
The businessman who is poor at golf meets his associates in the 
pool or chihhouNC. l\y extension, if the expectations of a given 
nuiividual could be raised, he would be more likely to enter the 
more clKiHengiii^ fields. Davis' results are es|>ei'ially interesting 
for, in view of lii,s s;unple. ii seems reasonable to conclude that 
any .studein had suflicieni ability to enter a "high performance" 
fiehl; the actual diirereiia-s in entry may be ascribed to differences 
in expectation for success. 

At this time, we simply do not have sufficient knowledge 
from which to derive effective procedures for producing the gen- 
eralized and enduring changes in expectations discussed here. 
However, we may note at least that these studies are consistent 
with predictions from expectation theory and that the conse- 
quences described are of major significance in the lives of the 
individuals studied. 
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CHAPTER SEVEN 



Raising Children's 
Expectations for 
Their Own Performance: 
A Classroom Application* 

DORIS R. ENTWISLE 
MURRAY WEBSTER, JR. 



In this chapter we arc concerned primarily with consequences of ex- 
pectations: that is, with examining the behavioral eflects prciuced by the 
expectation states actors already hold for each other. We are interested both 
in specifying more completely the types of effects that may result from these 
expectation states and in enumerating more of the empirical inteipretations of 
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fpecificd consequences. To state the distinction somewhat difTcrcntly, in 
preceding chapters one of the main interests has been specifying processes 
that would generate a particular pattern of expectations; in this chapter, a 
particular pattern of expectations is taken as the independent variable, and 
the interest is in specifying additional consequences, dependent variables, 
that will be produced by the expectations. 

A second way in which the work reported in this chapter differs from that of 
previous chapters is that the interest here is primarily in application of the 
theory to empirical situations, rather than in theory development. Thus for 
this work we take propositions of Expectation States Theor)- that have 
received some verification in the highly controlled environment of the social 
psychological laboratory and attempt to use them to produce desired results 
in a naturalistic situation. This work may help reduce the gap between the 
precise but artificial laboratory setting, and the imprecise but naturalistic 
classroom setting in which educational research is often conducted. 

Earlier chapters in this book have dealt primarily with problems of elabo- 
rating interactional and structural determinants of performance expectations; 
that is, with extending the basic propositions of expectation theory so as to 
predict the specific performance expectations formed from various combina- 
tions of initial status relations, agreement or disagreement interactions, unit 
evaluations of performances, or activating or making salient status charac- 
teristics. Most versions of the theory presented so far have included a state- 
ment of what has been called the "basic expectation assumption": the 
structure of expectations held by the group members will determine the distri- 
bution of the components of the obscrvuble power and prestige structure in 
the group. In chapter five, the source version of the theory was extended to 
include explication of the concept of a source of expectations, an actor whose 
evaluations arc accepted and are used by others us the basis of the expectations 
they hold for their own and each others' performances. The basic expectation 
assumptions— Assumptions 3 and 4 in that chapter— were not problematic in 
this task, for they were assumed to have received adequate confirmation in 
earlier research. 

In most (although not all) of the theory development research reported 
earlier the effect of expectation states produced by the various independent 
variables has been assessed using the single consequence rejection of influence 
under conditions of disagreement. For purposes of developing the theory by 
specifying additional determinants of expectations, this uniform measurement 
operation is an advantage, for it <rf"ien permits direct comparisons of results 
across experiments. The basic expectation assumptions, however, actually 
predict a variety of behaviors (including rejection of influence) that result from 
expectatioii states, and these are important components of interaction as well. 
These other components of interaction arc important both for explaining 
frequently reported results and for identifying variables that arc important to 
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processes outside the immediate group participating in an experiment. For 
example, likelihood of acceptance of an action opportunity, a component of 
observable interaction that is predicted to vary according to the level of 
expectations held, is important not only as a component of interaction in a 
given group, but additionally because it is related to such other problems as 
learning in the classroom, assertiveness, and the impression of competence 
given to teachers and to other students. 

Several investigators working with grade school children have reported 
results that in a general way are related to the variables of Expectation States 
Theory research. Rosenthal and Jacobson (1968), for example, report that 
when teachers are told that some randomly selected children were ''potential 
academic bloomers," the selected children sometimes showed gains in aca- 
demic achievement, both by teachers' ratings and by more objective measures 
such as intelligence test scores. Meichenbaum, Bowers, and Ross (1969) 
report similar results, and also show that teachers' behaviors toward the 
selected children differ from their behavior toward unselected children. By 
contrast, other investigators have reported clearly negative results from 
similar attempts to produce these effects (Ciaiborn, 1969; Jacobs, 1969). 
Using similar procedures in similar settings, they have been unable to produce 
similar effects. In this research, often called the "teacher expectancy" field, 
there are both successful instances and unsuccessful instances. This is per- 
plexing. But even more perplexing from our viewpoint is that no matter what 
the outcome, there is no single, clear, convincing rationale to support the 
choice of techniques, the selection of the sample, the success of an experi- 
mental maneuver, or. in the cases where it applies, the failure to observe 
predicted effects. 

The classroom research in expectancy concerns important phenomena, 
both in practical terms of children's learning and in theoretical terms of 
improving our understanding of educational institutions and the learning 
process, even though the findings have been sometimes inconsistent and con- 
tradictory. A major cause of the inconsistent results, we feel, is the lack of a 
sociological or social-psychological context. To our knowledge, work in the 
"teacher expectancy" field has not been guided in any systematic way by a 
theory that explicitly specifics the determinants, the definition, or the con- 
sequences of the "expectancies." We feel that Expectation States Theory, with 
appropriate interpretation, can specify the determinants, the definition, and 
the consequences of "expectancies," in this new research area. 

The work reported in this chapter is intended to be a direct application of 
Expectation States Theory to problems of interest in educational research. 
Many ordinary classroom interaction situations meet the (ask orientation and 
collective orientation conditions of Expectation States Theory. Also, many of 
the "observable components of the power and prestige structure" specified in 
the basic cxpcciaiiun assumptions are similar to variables studied by educators. 
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Allocation of action opportunities, for example, can be seen when a teacher 
calls upon students. 



Studies in teacher expectancy research usually include the following features 
First the investigator describes his research to participating teachers as being 
involved with academic achievement of students and with predicting and 
assessmg achievement. Students arc tested, using one or more of the common 
standardized psychological tests. Test scores of students are not revealed to 
the teachers. At some later time, usually after a few days, the teacher is told 
that some students (typically around 20 percent of the class) have unusual 
academic rotential. and ihut the teacher will probably observe unusual inteU 
lectual growth in those students during the coming months. The selected 
students are picked either at random, or in ways independent of their actual 
scores at the testing. Several months later, students are rctested. In a recent 
reassessment of this research (Rosenthal and Rubin. 1971) about 39 percent 
of such studies show positive results: selected students show greater gains on 
retesl than their unselecied classmates (see, for example. Rosenthal and 
Jacobson. 1968; Mcichenbaum et al.. 1969). Positive findings have also been 
noted when the objective tests used to measure expectancy eflects were admin- 
istered by the schools rather than by experimenters. 

Expectation States Theory suggests the following chain of events. The 
researcher emers a social institution where ability and evaluations of ability 
are cemral, and he presents himself as a capable judge of ability. The initial 
testing tends to legitimate his claims. The researcher, by stating that certain 
students possess unusually high academic potential, gives teachers information 
that should raise the expectations they hold for the chosen students 

If the teacher's expectations for a given student are raised, the basic expec- 
tation assumption predicts teacher behavior will be altered. The student will 
receive more action opportunities than other students, or than he received 
prior to the manipulation; for example, he will be called on more often in 
class. More importantly, this student will be more likely to receive positive 
evaluations from the teacher for any given performance output; that is 
whatever the student says in class will be more apt to be positively evaluated 
This student will be more likely to have his opinion agreed with by the teacher 
and by other students, especially in cases where the teacher and others have 
not yet decided upon the "correct" answer. 

The effect upon a student's expectations of raising the teachers expecta- 
lions for him may also be predicted from the theory: his expectations for his 
own performance should be raised. The theory sees one of the immediate 
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determinants of any individuaPs expectations as the positive and negative 
unit evaluations of individual problem-solving attempts. 

This analysis of the changes in teacher beha\ior points to selected students* 
receiving more positive evaluations, and as a consequence, coming to hold 
higher expectations for their own performance. Then according to the basic 
expectation assumption, the selected students will be more likely to emit per- 
formances. They will raise their hands more, and will speak out more in class. 
They will also be more likely to think their ideas are good ones, that their 
solutions to problems are the correct ones— in general they will be more 
self-confident and less likely to accept influence from others when their opin- 
ions are disagreed with. The process is circular, once set in motion, for these 
changes in students' behavior are of precisely the sort that are likely to lead 
teachers, even teachers who have forgotten the experimenters initial revela- 
tions about test scores or other teachers who did not receive the score infor- 
mation, to convey high expectations. 

Successful teacher expectancy studies are simply explained: students' 
higher grades on both standardized tests and classroom tests are the conse- 
quences of the expectation raising manipulation triggering the circular pro- 
cess described above. Higher scores on objective tests, for example, could 
stem from an improved mobilization of the student's resources in his now 
more responsive environment, from actual improved learning by greater class 
participation and involvement, or even from added increments of extra-school 
learning because improved self-confidence and positive expectations general- 
ized outside school First of ail, learning should be improved by the incicased 
interaction with the teacher and with other students, which would be directly 
predicted from the basic expectation assumption. Second, the behaviors 
associated with holding high self-expectations are probably important motiva- 
tional amplifier*^, perhaps leading to increased attention, curiosity, self- 
confidence, and iiiicrest in independent study. 

The processes involved in the teacher expectancy research are important, 
both in practical terms and in terms of application of formal Expectation 
States Theory. We therefore designed several classroom studies in which the 
work to be reported was guided by two interrelated goals: (t) to apply some 
of the basic terms and assumptions of Expectation States 1 heory as a tool for 
analyzing a naturalistic situation, and (2) to use the theory to suggest simple 
procedures and tasks that will produce useful results in the field. 

II. APPLICATION OF THEORY AND TASK DEVELOPMENT 

r^. guiding the tield studies in expectation raising, we decided to apply a 
version of lixpoctalion Slates Theory that had previously received laboratory 
test and support. Wc are concerned here with ability and evaluations of 
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ability, expccially as they arc aflfcctcd by opinions of a ^^significant other** 
such as a teacher or a parent. Therefore, it seemed appropriate to adapt the 
single-source version of the source theory developed in chapter five. For 
reference, the explicit definitions and assumptions are presented in the appen- 
dix to this chapter 

All versions of Expectation States Theory assume the initial conditions of 
task orientation (interest in solving some problem) and collective orientation 
(willingness to consider answers from any individual in the group). These 
conditions are frequently met in classroom interaction: by definition, much 
learning activity is directed towards problem solution, and except for written 
testing situations, ideas and advice from many individuals are actively sought. 
The source theory speaks of a source of evaluations^an individual accepted 
as more competent to evaluate performances than the subject of interest, p. 
In the classroom, the teacher fills this role, both by virtue of his or her greater 
knowledge, and, usually, by access to objective information such as an answer 
key. For our experiments, we decided to fix acceptance of the experimenter as 
a source; we /o/^ children that we were competent to evaluate their pcrfor* 
mances at the task to be described below (information that they were willing 
to believe). 

Given an accepted source. Assumptions I and 2 lead to the derivation that 
if the source (e) evaluates a series of performances by any individual (/'),/' will 
come to believe that his ability is consistent with the evaluations received*. 
High ability conception (or self-expectation state) is the direct consequence of 
receiving positive evaluations from a source. Adding Assumption 4a enables 
us to predict an observable consequence of the expectation state p comes to 
hold; the higher the self-expectation state, the more likely is he to accept an 
action opportunity and to make a performance output. 

Because we are interested in improving children's self-conceptions, we 
decided to attempt only to raise expectation states, not to lower them. (The 
latter would be desirable for theory testing purposes, not for our goal of 
practical application.) Thus, our experimental design calls for giving heavy 
doses of positive unit evaluations to selected children and predicting that this 
will raise their self-expectation states, and, consequently, increase their like- 
lihoods of accepting action opportunities. In the classroom, teachers distri- 
bute action opportunities to the entire class when they ask something like 
"^Who knows the answer to this question ?'* Children who think they know the 
answer— that is, children who hold high self-expectations for that task— accept 
the action opportunity by raising their hands or speaking out. Our experi- 
ment, described in detail below, was designed to be analogous to this sequence 
of behaviors. In general, we predict that we can increase the rate ut which a 
child raises his hand in response to group-directed questions us a direct func- 
tion of the scirc.Npectations that iic holds, and that we can increase the child's 
self-ex pcclation state by giving him a large number of positi\c unitcvaluations 
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of past performances. More precisely, we formulate for testing the following 
two derivations, using Assumptions 1, 2, and 4a from the single source 
theory: 

Derivation 1 

If an individual (p) has received no unit evaluations of his performances at 
time /| and receives a large proportion of positive unit evaluations from a 
source at time /a, then the likelihood that p will accept a given action op- 
portunity and make a performance output will be greater at time /} than at 

Derivation 2 

If an individual (pi) has received a large proportion of positive evaluations 
at time /a, then as compared to a second individual f pj) who has not received 
any performance evaluations, the likelihood at /| that Pi will accept a given 
action opportunity and make a performance output is greater than the like- 
lihood that Pa will do so. 

A large part of our initial work was devoted to development of an appro* 
priate experimental task that had to meet different requirements from those 
met by previous laboratory tasks. Laboratory studies usually alter expecta- 
tions by giving subjects false information. For example, subjects are given 
fictitious test scores or are led to believe that other subjects are disagreeing 
with them by means of apparatus that alters communications. All previous 
laboratory work and all teacher*expectancy studies mentioned earlier rely 
heavily upon the use of false information to alter expectations. 

Using false information may be very effective and consequently useful for 
research purposes (especially in laboratory studies where the deception 
involved may be explained immediately afterward), but for repeated use in 
applied research false information is clearly not desirable. In studies where 
expectation states are raised, there are both practical and moral difficulties 
associated with the continued use of false information. For example, one 
would not wish to tell a child that he is bound to do much better than he has 
been because if the prediction fails he may suffer an impairccf self-image. As 
for giving teachers false information, the Rosenthal and Jacobson studies 
have already received sufficient notice in the press so that teachers and princi- 
pals nod smilingly if an investigation reports "new high test scores.** In 
studies where expectation slates are relatively lowered, as in some tracking 
studies, there are moral difficulties. No one would care to tell a child that he is 
likely to fail in the next semester or that he will not do as well as he hopes. 
Research in a naturalistic setting thus demands modification of the main 
experimental maneuver used in the previous laboratory work with the need to 
alter expectation states in some way of her than the gi\ ing of false information. 

After some pilot studies we decided upon the prov'cdure described in what 
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follows, a modification of a story-telling task previously used in research on 
cognitive development of children (Entwisle, Grafstein, Kervin, and Rivkin, 
1970). One adult interacting with one child gives consistent positive evalua- 
tions of perforinance in a task where the child's actual ability is almost irrele- 
vant. The evaluations are therefore not inconsistent with anything known 
about the child or with his potential. The aim of the maneuver is to raise the 
child's expectations. In addition to meeting the major criteria listed above, the 
pilot studies demonstrated that the task met practical criteria: (I) it provided 
discrete, easily observable performance outputs that permit clear evaluations, 
and (2) it was interesting enough to capture children's attention. 

The experiment has three phases, corresponding to the three "times" 
referred to in the derivations. Phase I determines the "baseline" level of accept- 
ance of action opportunities for both the "experimental" and the "control" 
groups. Children fill in words in a story skeleton and the experimenter is 
neutral. The experimenter notes who raises his hand as each word is called 
for. (The experimental and control groups correspond to individuals p, and 
/>„ respectively, in Derivation 2.) In Phase II, the attempt is made to mani- 
pulate upwards the self-expectations of children in the experimental group. 
One child (experimental group child) fills in words in a second story skeleton 
and is praised and encouraged by the experimenter after every word. The 
other children (control group) participate in a neutral procedure. Then the 
group of children (experimental and controls) is reassembled for Phase III 
and fills in a third story skeleton, with the experimenter again neutral and 
noting who raises hands. 

To tie this back to the theory: the validity of Derivation I can be assessed 
by comparing the results of Phase 1 to Phase III for children in the experi- 
mental group. How well Derivation 2 is confirmed can be assessed by com- 
paring results of Phase III for children in the experimental group with results 
of Phase III for children in the control group. 

The basic experiment just described serves as the foundation for all empir- 
ical studies to be reported in this chapter, although modifications are intro- 
duced where necessary for purposes of studying the effects of new variables. 
Experiments have been classified into three "series," according to chronology 
and according to issues that emerged during the course of the research. 

For the First Experimental Series, work is concerned with testing the task 
and experiment designed: would they work in the way we expected on the 
basis of the theory and laboratory studies? Would the children understand 
and be willing to participate in the research? Is the procedure simple enough 
that we could show others who have little interest in abstract theory or experi- 
mental design how to raise expectations? In order to gain information on 
these and other basic questions, we applied the procedure to as wide a range 
of subjects (white rural children; black inner city children) as initially pos- 
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founh gJJJcT '"P'ctotion development (third and 

The Second Experimental Scries was conducted to expand downwards the 
white middle class children were the subject sample. This extension permits 

suTus' race T *° «««« relatireff^ro 

stotus, race, age, and sex factors upon the processes. 

ihlilrl^'^. Experimental Series is addressed to "special problem" issues 

Ind chn? K '^'Z^!"''^'^ '' of mismatch between source 

?tl th; tmr« ?rf. self-expectation 
s ates. the contrast effect" in our expectation raising experimentVand an 
ahernative "behavioral modification" interpretation 'of Tur ex^Viment^^^ 



III. FIRST EXPERIMENTAL SERIES 
A. SUBJECTS 

L7..II;«/'"' "1 f'O'n fo"' schools in the 

Baltimore area, t^o inner city schools with nearly 100 percent black students, 
and two rural schools w.th nearly 100 pcrcem white students. The rural 
Penntlv^niaTnc ^° """" Baltimore, near the 

. in* ' u"*' '''^ n'**""'"' ^hose school records showed 

tested IQ scores between 90 and 1 10 were selected for the study. The IQ ranse 
for rural students ,s considerably larger, from 76 to 141, but the mean IQ is 

wer/midSL '"^ ^^^''^ Experimenters 

were middle claw white persons in the 20-30 age range, both sexes. 

B. PROCEDURE 

At the beginning of an experimental session, children were brought together 
and told that the researchers were looking for people who vould tell good 

?1 ; J '° ^'"'^'^ '"'^ »oW that the team that 

made the best stones would win a prize. Then one experimenter took the 
rncmbers of each team to a separate room and described the storytelling t.isk 

J^n^Tn- r f ' '^'^^ 0" on« team came 

from dilTcrcnt claisrooms. 

Children were told that the "game" consisted of making up a story The 
experimenter would help by starting sentences, but then the children should 
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try to make interesting stories by supplying '^good"* words when asked for 
words for the story. In every phase the same story skeleton (see Figure 7-1) 
was used, and the skeleton contained twelve blanks. Children filled these 
blanks with twelve words or phrases. 



THERE WAS ONCE A VERY TALL PRINCE WHO HAD A (casile) 
THAT (HE, SHE) (lived in) 

ONE DAY (HE, SHE) HAD TO GO TO (the dungton to ttf his prisoners) 
(HE, SHE) DID THIS VERY (angrily) 

BECAUSE (HE, SHE) WANTED TO (make sure they were there) 
THIS WAS VERY DANGEROUS BECAUSE OF THE (strong prisoners) 
WHO (WHICH) WAS (WERE) VERY (mean) 

IN ORDER TO FOOL THE (FILL IN) THE (FILL IN) DRESSED UP AS 

(another prisoner) 
IN SUCH A DISGUISE THE (FILL IN) LOOKED (mean) 
And when the (fill IN) saw the (fill IN), they (welcomed him) 
THIS MADE THE (FILL IN) (feel pretty good) 
AND /COMPLETE STORY/ (he let his new friends go). 



The txparimentor chouses from atlernativfls m paronihescs th« iiem consistent with the story line. Foi ex- 
ample, in this story the pronoun "he" is chOMn because it refers lo "pnnce " 

FIGURE 7*1 

Story Skeleion with Sample Entries from a Rural Group 

Children were told to listen carefully while the sentence was being read, 
then when the blank was reached, to try to think o( a good word. Anyone who 
thought of a good word was to raise his hand, and the experimenter would 
select one child to give the "team*s word" for that sentence. Children were 
cautioned not to raise their hands unless they thought they had a good word, 
for if they were called on and gave a bad word, this would hurt the team^s 
score. The purpose of this instruction was to help maintain the task orienta- 
tion and collective orientation required by the scope conditions of the theory. 
The experimenter allowed 30 seconds to elapse after reading a sentence before 
calling upon a child. 

Before calling on a child, the experimenter recorded privately which chil- 
dren were holding up their hands (the measure used to determine the expecta* 
tion state —acceptance of an action opportunity). The experimenter held a 
clipboard so that children could not see what was on it. The clipboard was 
used for recording words given by the children. During the 30-sccond waiting 
period, he made small marks indicating w hich children had volunteered with* 
out the students being aware of this. 
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Phase I consists of an initial story being produced as just described. The 
cxperimenier (E) does not evaluate any of the words given during Phase I. He 
calls upon each child in Phase I approximately the same number of times. 
With twelve blanks to fill in and with four children playing, each child can be 
called upon three times. In only a few instances did any child volunteer fewer 
than three times altogether and so unbalance the selection of respondents. 

At the end of Phase I children are asked to return to the room where they 
initially assembled. After they begin to move out, E quietly tells one child 
(chosen because his level of hand-raising was near the median for the group) 
to stay in the experimental room and wait there a minute. After E makes sure 
that the control children are on their way to the proper destination, he returns 
to the room and to the selected (experimental group) child. E then tells the 
experimental child that he will have an opportunity to make up a story all by 
himself; also that E has played with many children making up stories and 
that C thinks he. she (ihc experimental child) is really good at the task. Then a 
story skeleton with a new lead word is filled in orally by the child, just as 
before, but of course with only one child producing the story there is no 
volunteering. The child merely supplies a word for the blank when the sen- 
tence is read by E. After each blank is filled. E indicates approval vigorously 
—•by smiling, by nodding, by commenting "a very good word"—"good"-- 
"that's interesting!", etc.. that is. he indicates approval in every possible way 
consistent with sincerity. 

Several experimental groups (four or five) were run simultaneously with an 
E for each group. A single coordinator for all the groups managed the initial 
explanation, story-reading for the Phase II control groups, and prize award 
sessions. When Phase II for the experimental child was complete, he went and 
joined the control children in their Phase 11. Phase II for control group chil- 
dren is a story-reading session, and control children from several game 
groups pather in a central room as they fin*:h Phase I to listen to the coor- 
dinator read a story. Children from the several experimental groups join the 
story-reading group (control groups. Phase II) as they finish their Phase If 
acti\itiei The experimental children thus listen to the end of the story being 
read lo the control groups. The experimental children's entrance is not notice- 
able bccauic they join the story- rc.iding group while the children's attention is 
directed toward the story teller. The story-reading prevents communication 
among the children during Phase II. At the end of the story-reading the 
children arc asked to "go back to the room where you were before." /<// chil- 
dren thus return loiicihcr. At this point E s are rotated among rooms so each 
t has a new group and is unaware of the identity of the e\,)erimental child. 
Phase 1 1 1 consisti of a repetition of Pliase I with the experimenter noting ho;v 
mjn> limes each child \oluniccrs lo supply a word. The length of each phase 
varies, as would be c\pected. but the Phase II control procedure can adjust 
its Icneih lo ihe lime requiremcnb of each set of e.Npcrimcnis. 
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At the conclusion of Phase III, the children are again brought together in a 
large group. One team is selected at random as having constructed the best 
team stories, and each member of the winning team is given a pr'ut of a 
regular size candy bar. All other children are given a miniature candy bar, 
thanked Tor their help, and escorted back to their classrooms. Figure 7-2 
summarizes the experimental design for all this research. 

. Results 

« 

Table 7-1 presents the gain in the number of times children in the control 
groups and in the experimental groups volunteered by raising their hands 
from Phase I to Phase III. (The mean number of times children raise hands in 
all groups is about 6.7 in Phase I.) A /-test of the mean gain of the experi- 
mental groups vs. the mean gain of the control groups is highly significant 
(^('no 3.17) < 0.01). Other /-values arc displayed in the table Tor various 
subdivisions of the sample. It is noteworthy that in each stratum the gain Tor 
the experimental group exceeds the gain Tor the control group, even though 
not every individual comparison is statistically significant on /-test. 

TABLE 7*1. Gains in Rate of Volunteering from Phase 1 to Phase ill. 
(A^'s in parentheses) 

Probability level. 
One-sided /-test or 
OifTerences between 



Experimental Experimental and 

Subjects Control Subjects Control Groups 



All Black 


(36) 


+ 1.22 


(108) 


+0.64 


N.S. 


3rd Grade 


(20) 


•M.8S 


(60) 


+0.88 


N.S. 


4lh Grade 


(16) 


+0.44 


(48) 


+0.33 


N.S. 


All While 


(32) 


+2.28 


(96) 


+0.36 


0.001 


3rd Grade 


(22) 


+2.45 


(66) 


+0.S6 


0.001 


4th Grade 


(10) 


+ 1.90 


(30) 


-0.07 


0.02S 


Grand Total 


(68) 


+ 1.72 


(204) 


+O.SI 


0.001 



The success of the story-telling task may also be assessed by noting the 
proportion orchi'drcn in the experimental groups \\hose rate of hand-raising 
increased Trom Phase I to Phase 111 compared to the proportion in the control 
group (Table 7-2). 

The /-tests assess the magnitude of the gain but this increase could come 
about because some children's rates increased markedly even though others 
did not. My examining the proportion of children who gain, one has informa- 
tion on the consistency of the effect. Proportions i rhe experimental groups 
consistently exceed proportions in the control grou^^^. The overall proportion 
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TABLE 7^2, Pcrcontago of Persons Showing Gains from Phase I to Phase III. 



Experimental Subjects Control Subjects 



All Black 


68 


53 


3rd Grade 


78 


68 


4th Grade 


56 


33 


All White 


86 


55 


3rd Grade 


89 


61 


4th Grade 


80> 


43 


Grand Total 


76 


54 



of children gaining in the experimental groups is 76 percent, compared to 
54 percent, just about chance level, in the control groups. 

Thus the predictions of Derivations I and 2 are in general borne out by the 
results of the research. The data of Tables 7-1 and 7-2 show that the proce- 
dure apparently produced increases in expectation states, as measured by 
acceoiance of action opportunities, although success was much greater with 
'-cn than with black children. 

C. ASSESSMENT OF fIRST EXPERIMENTAL SERIES 

In the First Experimental Series. 76 percent of the childhen who received the 
experimental treatment showed the desired increase, a significant gain when 
compared with that made by control students. Operationally the experimen- 
tal procedure was a success. From a practical standpoint, the procedures 
were simple and easy to use. Field notes indicate that the children found the 
task involving and enjoyable and were highly motivated to succeed at it. 

Variability in gains resulting from using this experimental procedure 
deserves some comment. (The black children have already been commented 
upon.) As a child grows older, his expectations for his general performance 
level at most tasks probably crystallize. Thus a greater increase for third 
graders than for fourth graders in both the control and experimental groups 
is not surprising. In fact such considerations led us to select third and fourth 
graders initially rather than older children who might have been preferable as 
subjects on other grounds. It also led to our extending the age range down- 
ward as reported in the next section (Second Experimental Seiies). 

As mentioned earlier, maneuvers that give fictitious results to students or 
to teachers have drawbacks. Such maneuvers may be defended when they are 
one-shot procedures to demonstrate self-fulfilli.ig prophecies, placebo efTecls, 
and the like, or when they are needed to allow quick and sizeable manipula' 
lion of variables in the laboratory. But such maneuvers are not defensible 
over the long term. They are not even viable when the goal is to change chil- 
dren's long-range expectations for themselves or others' expectations for 
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children in classroom settings. If one goal of education is expectation altera- 
tion, then of altering expectations must be found that are compatible 
with other educational goals and that arc suitable over the long term. The 
procedure described here has achieved modest success along these lines. 

Besides the practical and educational criteria considered above, one must 
also evaluate the procedure in terms of its ability to opcrationalize the vari- 
ables of the theory. Probably both "expcrimemer effects" and unintended 
biasing arise. Third, and perhaps most important, the question of alternative 
interpretations may be raised. Does Expectation States Theory provide the 
best context for interpreting the present results? We will now analyze these 
three questions in turn, and in the section on "Special Control" experiments 
we will reconsider them in the light of additional data. 

(1) Experimenier effects were equalized insofar as possible. First, E's were 
trained to treat the children as equally as possible in Phase I and to refrain 
from evaluating any performances. The child selected as the "experimental" 
child at the end of Phase I was chosen on the basis of his having responded 
dose to the median for his group. To choose a child with too iow a response 
rate in Phase I might have biased results in favor of predictions through a 
"regression to the mean" phenomenon in Phase III. To choose a child with 
too high a response rate in Phase I would have biased results against predic- 
tions, because of a "ceiling effect." 

To avoid drawing attention to the child selected to participate in Phase II, 
E did not say anything to that child until the children were on their way to the 
other room for Phase 11; nor did the E explain at all to the others why this 
child was asked to remain behind. In most cases the fact that one child 
remained behind was apparently not noticed. 

In addition, the control children were occupied at similar tasks in Phase II. 
All children w en with an experimenter during Phase II. because control chil- 
dren were with the research coordinator in a sto.y-reading session. The 
experimental group children participated in at least part of the control treat- 
ment because they joined the story-reading group as they finished the Phase 
JI experimental treatment. 

(2) The issue of biasing is troublesome. Tor even a slight change in a teacher's 
manner will change a child's disposition to raise his hand. Therefore, a num- 
ber of steps wen regularly taken to minimize bias from E. The major danger 
of biasing would come in Phase III. however, when E might respond more 
warmly or more positively to the "experimental child" than to the others. 
This child might also feel that because of the individual session with E in 
Phase II he (the child) had some "special" relationship with E. In order to 
minimi/e problems of this nature. E's were rotated before the beginning of 
Phase III. Thus the E was new to thp entire group in Phase III, unfamiliar to 
both the experimcnial and control children. Furthermore, the E did not know 
al this point uiiich child itad been given the Phase II "treatment" and thus was 
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unlikely to treat the children diflcrcntially. In Phase III, as In Phase I, E's are 
trained to call on every child the same number of times insofar as this is 
possible. 

(3) Alternative interpretations of experimental results are relatively easy to 
invent. Certainly we cannot rule out all competing interpretations for the 
results reported here. It seems difficult, however, to contrive an explanation 
that accounts for all our results, as well as for work by others mentioned 
earlier, as satisfactorily as Expectation States Theory. Suppose, for example, 
the effect of the treatment of the experimental group child in Phase II was to 
ofier additional practice in story telling, and that practice per increased the 
child's confidence in his ability to tell stories. This would imply that the 
strongly positive Phase 11 evaluations were unimportant, and possibly that 
the level of expectations in Phase 111 was irrelevant to behavior (hand raising) 
as well. But the task was selected partly because it calls for no special ability; 
it certainly is not one that can be "improved*" through practice. Without 
evaluations from E, children ha\e trouble deciding whether their words are 
"good ones": children's remarks in the "unevaluated" Phase I point to this. 
So practice as an explanation is not appealing. 

The results, by way of another example, might be accounted for by seeing 
the Phase II procedure as reinforcement of behavior rather than positive 
evaluation. The experiment might be seen as demonstrating that children will 
increase the rate of emission of behaviors that have been rewarded in the past. 
In assessing the reinforcement explanation, the reader should note carefully 
that our measure of expectation states in Phase I and Phase III was rate of 
volunteering, or hand raising. What were positively evaluated in Phase II were 
words spoken by the children. At no time during the experiment was anyone 
positively evaluated (or reinforced) for hand raising, the measured behavior. 
Positive evaluation (or reinforcement) was never given for hand raising 
because during the Phase II experimental treatment the child did not raise his 
hand. In the other phases when hand raising was occurring no evaluation (or 
reinforcement) was given. Furthermore, the specific words evaluated in Phase 
II were seldom the words volunteered in Phase III. Thus, a reinforcement 
explanation lacks force on analytic grounds; it requires making several 
tenuous interpretations of experimental variables, and making some rather 
complex assumptions about stimulus and response generalization. 

Other research that explicitly measures children's expectations by using a 
kind of self-rating scale is consistent with our results. Expectations are seen to 
increase following approval und positive evaluation by an adult, but are 
unchanged when an adult maintains a neutral role (Hill and Dusek, 1969; 
Crandall, 1963; Cr.indall, Good and Crandall. 1964). The actual procedure 
used by Hill and Dusck was \cr> similar tu our own. for an adult responded 
"That's good. rinc. Very g >od. You're doing well" for positive evaluations 
following .ittcmptN at an angle-matching task. The adult was neutral and 
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nonrcsponsivc for the noncvaluative condition (like our Phase I and Phase 
Hi procedure). Other studies, also consistent with the conclusion, show that 
if an individual gets approving reactions from others with respect to some 
specified attribute, he will improve his self-rating on that attribute (Maehr, 
Mensing. and Nafzger, 1962; Videbeck, I960). 



IV. SECOND EXPERIMENTAL SERIES 

The overview of the first set of classroom studies is encouraging, both in terms 
of procedure development and theory application. The Second Experimental 
Series aimed to extend the age range and dealt with middle class children. We 
wished to determine the age limits within which experiments would be robust 
and wished to see whether sex and race subgroups responded diflTerentially to 
the procedure (hinted at by the apparently lower susceptibility of black 
children). Also, of course, further experiments provide replication of earlier 
work. 

A. SUBJECTS 

The experimciitui procedure ulready described was used with a sample of 
children from a uhiie middle class suburb of Baltimore. There were approxi- 
mately equal numbers of bo>s and girls, with 79 first graders, 84 second 
graders, i 12 third graders, and 103 fourth graders. Insofar as possible, the 
four members of each experimental group were chosen from different class- 
rooms. Some of the grades were distributed among four classrooms, some 
among only three. In the latter case, two children were taken from a single 
classroom, and the remaining two from two other classrooms. 

B. RESULTS 

For this and later experiments, we now change the method of reporting 
results and report results of anal>ses of variance. With a factorial design 
where treatment (experimental vs. control), grade, and sometimes residential 
locus arc considered to be three fixed-effect factors, the data can be revised to 
yield proportional subclass numbers by randomly discarding some experi- 
mental groups. One second-grade group, eight third-grade groups and six 
fourth-griuic groups were therefore eliminated using a random process before 
analyzing the data shown in Tabic 7-3. 
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TABLE 7-3. Average Gain in Rate of Volunteering From Phase I to Phase III, While 
Middle Class Students {N's in parentheses) 





1 


2 


3 


4 


Av. 


Experimental Croup 


1.20 


2.SS 


1.83 


1.85 


1.86 


(20) 


(20) 


(20) 


(20) 


(80) 


Control Group 


0.67* 


0.67 


1.20 


1.38* 


0.98 


(59) 


(60) 


(60) 


(59) 


(238) 


Difference Between Groups 


0.S3 


1.88 


0.65 


0.37 


0.88. 



*Thit mtan it biMd on 59 rather than 60 obtervationl. 



Tabic 7-3 shows the mean gain in rate of volunteering from Phase I to 
Phase III by grade and sex for treated (experimental group) and untreated 
(control group) middle class children. Groups are approximately balanced 
for sex. Three sorts of changes in the dependent variable are reflected in this 
table. First, all experimental and control groups show some increase in rate of 
volunteering. Second, for all grades the increase is greater for children in the 
experimental group than for those in the control group. Third, second graders 
show the greatest difference between the experimental and control conditions. 
These results all are consistent with results of the First Experimental Series. 

The analysis of variance in Table 7-4 based on gains in rate of volunteering 
from Phase I to Phase III reveals a significant treatment effect (p < 0.05), no 
significant grade differences, and no significant grade x treatment interaction. 

TABLC 7-4. Analysis of Variance for Gain in Rate of Volunteering. White Middle 



Class Students. (Approximately balanced for sex) 



Source 


d.f. 


Sum of 
Squares 


Mean 

Square 


F*value 


Experimental vs. Control Treatment 
Grade 

Treatment x Grade 

Within Treatment x Grade Groups 


1 
3 
3 

3l0t 


46.81 
22.42 
20.27 
2409.70 


46.81 
7.47 
6.76 
7.77 


6.02* 

0.96 

0.87 


*Btyond the S percent level 

tTwo obiervationt are m«»ing. estimated by tubclast means. 



It is of interest to combine data from third- and fourth-grade white middle 
class subjects of this experiment with the rural and inner city data discussed 
in the First Experimental Series. To do this, the earlier data have also been 
reduced by randomly discarding six black inner city fourth-grade groups and 
two while rural third-grade groups (sec the data summarized in Tabic 7-2 vs. 
data of Table 7-6). Some of the fourlh gradc subjects included in Tables 7-3 
and 7-4 have also been eliminated randomly. 
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For the combined subjects of the First and Second Experimental Series, 
the treatment efTect is again Judged highly significant (p < O.OI). There is no 
evidence here that the cfTect of the treatment is different by grade or residen* 
tial locus, since none of the interactions of treatment with any other factor is 
significant. 

TABIC 7-5. Average Gain in Rate of Volunteering From Phase I to Phase llt« (or 

Children from Three Residential Loci« Grades 3 and 4 (/V*s in parentheses) 



Residential Locus Suburban Inner City Rural 



Grade 


3 


4 


3 


4 


3 


4 


Experimental Group 


I.8S 


2.20 


1.85 


1.90 


2.60 


1.90 




(20) 


(10) 


(20) 


(10) 


(20) 


(10) 


Control Group 


1.20 


2.07 


0.97 


0.20 


0.57 


-0.07 




(60) 


(30) 


(60) 


(30) 


(60) 


(30) 


Diflference Between Groups 


0.6S 


0.13 


0.88 


1.70 


2.03 


1.97 



The combined data show that expectations were raised significantly in 
experimental children compared to control children, and as shown in the 
other analyses (Entwisle and Webster, 1973a), residential locus probably does 
not affect susceptibility to treatment (no T x R interaction). 



TABLE 7 6. Analysis of Variance (or Gam in Rato of Volunteering; Three Residential 
Loci; Two Grades 







Sum of 


Mean 




Source 


d.r. 


Squares 


Square 


/'•value 


Treacment (T) 


1 


110.21 


110.21 


14.75* 


Residential Lucus (R) 


2 


43.62 


21.81 


2.92 


Grade (C) 


1 


0.87 


0.87 


0.12 


r X /! 


2 


22.2 i 


li.lO 


1.49 


r X C 


1 


0.S0 


0.S0 


0.07 


^ X C 


2 


35.77 


17.88 


2.39 


7* X /? X C 


2 


2.27 


1.14 


0.15 


Within r. R, G Subgroups 


347t 


2S93.IS 


7.47 





'Beyond the 1 percent level 

ITvvo obie'veiions a;e musing : eitimeied by lub: eis ream. 



C. DISCUSSION 

The results of the Second Experimental Series extend the positive findings of 
the First, since white middle class children also respond to the experimental 
procedure by increasing their rate of hand raising. Although there are other 
groups that could be studied (for instance black middle class), the range of 
children already included suggests that the phenomenon is replicable and 
fairly ^'cncral. 1 he parallels are clear between the experimental task, comple- 
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tion of a story under the experimenter^s direction, and many tasks overseen 
by teachers in r^igsrooms. Expectation States Theory has» then, at least some 
relevance to elementary education. 

The significance of social status in this study difTers from its significance in 
earlier Expectation States Theory work. Generally, previous status charac- 
teristics research has been concerned vyith the effect of differential status in 
assignment of power and prestige within a small group. E. G Cohen (1968, 
1970), for example, has studied mixed groups of black and white boys 
attempting to solve a problem together, to see who makes more performance 
outputs. In the laboratory, Webster (1970) has studied the effect of status 
characteristics on the effectiveness of evaluators; performances were moni- 
tored by evaluators of high status (college students) or of low status (eighth 
graders). In both E. G. Cohen's and Webster's studies, it has been shown that 
the status characteristic will, under certain circumstances, lead to differential 
conceptions of ability in accord with the different states of the diff*use status 
characteristics. 

The focus of the present work differs. Here we have groups, all of whose 
members arc equal with respect to a status characteristic (such as sex, age, and 
race). The question is whether expectations will be raised to the same extent 
for members of each status group. Can expectations of rural children be 
increased as easily as expectations of inner city children, for instance? The 
intent is thus to look at the ^'demography*' of expectation raising, particularly 
for the kinds of children— rural or inner city— who are often classed as low 
achievers. This experimental treatment for modifying children's expectations 
has worked with children from three very different residential settings and no 
significant differences by residential locus have appeared. The effect has been 
greatest for rural students and the suburban second graders, but not signifi- 
cantly greater (no digniHcant treatment x locus interaction). So far age does 
not appear to be important in expectation raising, although only for one 
group (middle class) has age been sampled over any sizeable range. 

More work is needed, however, to increase confidence that groups are as 
homogeneous as suggested here. For one thing, over the grade school years 
there are rapid shifts in children's interests and capabilities, so that a task 
suitable at one level may be inappropriate, or relatively ineffective, at other 
levels. The present research assumes that the group is collectively oriented and 
seriously motivated towards high performance of the task at hand. Partial 
failure to meet these conditions would attenuate any observable effects of 
expectations, or of our attempted experimental manipulations. 



V. THIRD EXPERIMENTAL SERIES 

The S\k\\s orciMiccrn in the l'ir>( l;\pcrinicntal Scries was task development 
and general feasibility of the research program. The Second Experimental 
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Series focused upon determining the generality of the efTect across societal 
status groups. In the Third Scries we ^\U€ concerned with pursuing some 
suggestions^ and also possibly problems, that grew out of the earlier experi- 
ments. Some instances of possible refractoriness to the expectation-raising 
treatment had occurred and these deserved more study. In addition, alterna- 
tive interpretations of the results could be investigated with further work. 

In studies reported to this point two kinds of children are not very respon- 
sive to the treatment: inner city black children and white middle class girls of 
third and fourth grades. The results for inner city children have already been 
discussed at some length. The lack of response in middle cldj>s girls has not 
been obvious to the reader because it was not possible to include sex as a 
factor in the analysis. The reader will note« however, in Table 7-7 u here the full 
set of data for third- and fourth-grade suburban children is presented and 
tabulated by sex, that boys show sizeable increases and girls actually show 
* decreases. 

7ABL£ 7*7. Average Gain in Rate of Volunteering From Phase I to Phase III for 

White Suburban Children, Third and Fourth Grade. (A/'s in parentheses) 



Girls Boys 
Experimental Control E-C Experimental Control E*C 







Grade 3 








0.36 


0.86 


-0.50 


2.29 


0.71 


1.S8 


(14) 


(42) 


Grade 4 


(14) 


(42) 




1.08 


1.46 


-0.38 


1.86 


0.S2 


1.34 


(12) 


(35) 




(14) 


(42) 





Also* during the experiments the experimenters observed that the white 
middle class girls were particularly w ithdrawn and unresponsive. We therefore 
performed two replications. 



1. In the first replication we relumed, exactly one year later, to the inner 
city schools where we had conducted the first experimental series with 
black children. On the return visit we employed only black female ex- 
perimenlers. Everything else in the replication including time in the school 
year, was the same as in the First Series. 

2. The second replicntion study consisted of experiments with only girls of 
the third and Tourth grades in a white middle class suburban school. The 
school where the replication was carried out was dilTcrent Trom the school 
in the Sccoiul Scries, but was \cry much like the Second Series school in 
social cLiss nnd other characteristics, (f urthcr work could not be done in 
the first .school because all students had already participated.) 
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A. BLACK INNER CITY REPLICATION 

Two general lines of thinking seem to be consistent with the relatively lower 
efficacy of the experimental treatment for inner city black students. The first, 
which we may call the "debilitating eflect or school hypothesis,** holds that an 
important outcome of school for black children is to lower their self- 
confidence and seir-evaluation. We speculate that as black children go through 
the grades, they receive predominantly negative evaluations of performances, 
and consequently become progressively more certain that they will fail at 
anything they attempt. They come, in other words, to hold self*expcctation 
states that are fixed and low. If such cfTects have been building up over several 
years in school, our experimental procedure may be just too weak or too 
short in duration to produce any marked expectation raising with these 
children. The fact that eflTects of the procedure vary inversely with school 
grade for all three status group samples in the First and Second Experimental 
Series is consistent with the idea that expectation states become more difficult 
to modify as children become older. 

An entirely different explanation is suggested by the work of Katz and his 
associates (1968, 1970), who present indications that white adults are per- 
ceived as hostile by some black children, and that when white examiners test 
black children, the children often assume that they are being compared to 
white children. In the former case, our white experimenters may not have been 
accepted as "sources'* by the children; that is, as "significant others," whose 
evaluative opinions arc accepted by them. In the latter case, we would expect 
that children would form low self^expectations because of the inferred com* 
parision to white children who possess higher diffuse status, in the manner 
described theoretically by Berger et al. (1966), and documented with grade 
school children by E. O. Cohen and her associates (1970). 

One way to disringuish between these competing explanations is to repeat 
the earlier research using black experimenters, ir the "debilitating* effect of 
school" is the explanation, then black experimenters should also be ineffective 
in raising expectations of black children, ir racial mismatch was the problem, 
then black experimenters should be effective. 

For this series, black won- :-n, students at a nearby college, served as the E*s. 
As already mentioned, the schools selected for the experiments were the same 
inner city schools as those used the previous year. Students were all black 
fourth graders, the most resistant group in the earlier study, and also the 
group best suited for distinguishing between the competing explanations. Of 
course (hey were not the same students as those who had previously par- 
ticipated. 

Table 7-8 shows tiiC gain scores for children using this sample and the black 
experimenters. Control group children show an average gain of 1.82, and the 
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average gain in the experimental group is 3.37. A Most of ibe difference in 
gains is significant beyond the 5 percent level. 



TABLi 7-8. Avtraga Gaini in Voluntaaring ; Black Si : Black Ea 



Group 


n 


Gain 


Experimental 


S7 


+3.27 


Control 


19 


+1.82 



Black experimenters were thus successful in producing a gain in the rate of 
volunteering of black children. In previous data for inner city blacks with 
white experimenters, gains were not significant (see Table 7-1). 

A number of reasons could account for the discrepant results of the two 
experiments with black inner city children. The most appealing reason is the 
racial difference in experimenters from the first experiment to the second, but 
a much more tightly controlled set of experiments is required to rule out other 
explanations. The experimental procedures, for example, were considerably 
refined by the time of the repeat experiments. Also, the black E's were proba- 
bly more alike and more uniformly skillful than the E's used in the First 
Experimental Series. Nonetheless, significant differential gains are seen in a 
subgroup that previously had not manifested significant gains, and this argues 
once more for the general effectiveness of the experimental procedure. 

Temporarily being less cautious, we note some suggestions based upon 
these results and the ideas mentioned earlier from Katz*s work. This work 
indicates that some E*s may not possess the necessary attributes to serve as 
effective sources. Attributes of a source, besides higher status (white skin 
color and/or age) or higher ability, reflect on that source's effectiveness as a 
purveyor of expectations, no doubt. Perhaps some sources can never achieve 
complete credibility for some subjects even though the status or ability cri- 
teria are met. An example will make this clearer. Suppose two social scientists 
are colleagjcs. and one is highly eminent and nationally known, whereas the 
other has only a local reputation. The eminent colleague is disposed never to 
criticize and always makes favorable comments about the work of his local 
colleagues. In this situation the less widely known social scientist will not 
weigh very heavily any favorable remarks from hi^ eminent colleague because 
he sees very little variation in the tenor of the eminent colleague's remarks; no 
matter what occasions them, they are always positive. The eminent colleague 
does not possess credibility in the .sense we have used the term even though 
his status and other attributes, as well as his positive remarks, meet the surface 
conditions of serving as a source. In the same vein, if black children interact 
mostly with adults (white teachers who elect to work in the ghetto) who»c 
strong tendency may be to remain pleasant and to refrain generally from 
negative c\aluations, such adults may lose credibility as sources for raising 
expectations. 
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The issues here are undoubtedly complex. Studies- by Katz and his associ* 
ates (see Katz, 1970) suggest that a black examinee s perceived probability of 
a successful performance on a test is apparently determined by his beliefs 
about the reference group he is being compared with— if black comparison 
groups are explicitly mentioned, the black examinee does best with a white 
administrator. In our use of white E's with black S's in the First Experi- 
menUl Series, the children might have assumed they were being compared 
with other children (mostly white) whom the white E had dealt with in the past. 
In this case, following the line of reasoning suggested by Katz's work, the 
probability of success perceived by black children may have been low. 

In applied work the characteristics of a source that makes him effiective at 
raising expectations may be a research U .ue of high priority. Teachers, for 
example, of the same ability (educational level) and status are notoriously 
variable in their influence on students. One thing making a teacher effective 
may be ability of the teacher to hold and to convey high expectations about 
students. A teacher's high expectations for a student may be a powerful 
inducement for the student to enter a high expectation state for his own per* 
formance. 



B. SUBURBAN FEMALE REPLICATION 

At the beginning of this section (see Table 7*7) it was noted that third* and 
fourth-grade suburban girls actually decrease their rate of volunteering from 
Phase I to Phase III. When these data are pooled with other suburban data 
and sex is not included as a factor in the design, incrcoses in other groups are 
sufficient to mask this finding. Overall there is a significantly larger gain 
score for experimenial subjects versus control subjects in spite of the negative 
gains observed fur these girls. Since research in on early stage, like theexplora* 
tory studies presented in this chapter, cannot be entirely insensitive to patterns 
in the data, we oblnined permission to run an additloiml fifteen experiments 
(n 60) with third and fourth grade girls only at a suburban school much 
like the one enrolling the suburban children of the Second Experimental Series. 
These results are shown in Tabic 7-9. 



TABU 7'9. Averaga Gains fn Volunitering .- Whiia Suburban 
Ptmate Subjects: Grades 3 and 4 



Group 


A 


Gain 


Experimental 


15 


+ 1.80 


Control 


43 


-f 1.49 



For experimenjal group girls ir this replicating experiment, the mean gain 
in rate of volunteering from Phase I to Phase 111 is 1.80; for control group 
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Sirls it is 1.49. The difTcrence, 0.31, is not significant on /-tesc. Our tentative 
conclusion is, therefore, that the present procedure has not been eflcctive for 
third and fourth grade white suburban girls, aithougli we now believe it is 
probably cfTcctivc for all other groups, if races of experimenters and subjects 
are matched. 

What factors could account for the sex difierence in effectiveness? It is 
thought that boys, more often than girls, are blamed or criticized by teachers 
of elementary school (Brophy and Good, 1969, 1970). Because of this, the 
expectation-raising procedure of Phase II may be considerably more effective 
for boys than for girls. Boys who hear consistent praise during the experiment 
may be contrasting it with a background before the experiment of little posi- 
tive evaluation. Girls, on the other hand, who apparently are seldom subjected 
to blame or criticism, could interpret the expectation raising maneuver of 
Phase II as a continuation of positive evaluations they have been receiving all 
along. Or, since girls generally do well in elementary school compared to 
boys, their expectations for performance at verbal tasks may already be at an 
asymptote so that attempts to raise expectations further are fruitless. Our 
data do not allow a choice between these explanations. 

. C. SPECIAL CONTROL cAPERIMENTS 



In assessing the basic experimental procedure tested in the First Experimental 
Series, we concluded that the Expectation States Theory interpretation of the 
results was more plausible than alternative interpretations. The most reason- 
able alternative interpretation seems to be one that asserts that the experi- 
mental procedure produces increases in hand raising, not because of any 
improvement of the child's expectation state, but rather because of rewarding 
features of the Phase II situation. The alternative interpretation consists of 
one or both of two basic arguments: (I) experimental group children are 
responding to the situational rewards of receiving extra attention from an 
adult during Phase II; (2) experimental group children gain confidence 
through the additional practice of constructing an additional story during 
Phase II. Either or both of these elements can then be combined with implicit 
assumptions about behavior reinforcement to explain the increased hand 
raising observed for experimental group children, without any reference to 
expectation states or any change i-. the child's cognitions about his ability. 

To provide empirical information relevant to the behavioral reinforcement 
inlcrprctution of all these experiments we decided to conduct a set of cxperi- 
mencs for which special controls were devised. In this set of experiments, two 
of the four children in each group were treated separately during Phase II. 
One child made up a story and received positive evaluations, exactly as the 
expcrimcfit;il group child had in all previous experiments. The other child, 
whom wc &hali cull the "special control group" cliild, made up a story alone 
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with an experimenter, but did not receive the positive unit evaluations or 
words. The experimenter for this group was neutral, nonevaluative; the 
child's words were received and written down silently, and the experimenter 
attempted to avoid any verbal or Tacial expressions of either approval or 
disapproval. Experimenters rotated among phases of the experiment so the 
experimenter effects would be balanced. Experimenters were unaware of the 
identity of either the experimental or special control child in Phase III. 

Treatment of the special control group children thus incorporates the 
**special attention" and "practice** features of the experimental group treat- 
ment, but omits the positive unit evaluations that the theory asserts are essen- 
tial in this situation for producing expectation state changes. 

Subjects for this set of experiments were third graders at the white, suburban 
school used for the Second Experimental Series. Table 7-10 presents the mean 
gain scores from the control, special control, and experimental group children. 



TA6L£ 7'10. Avtragt Gains in Volunttering; Special Control 
G:oup Exptriments 



Group 


n 


Gain 


Experimental 


22 


1.36 


Special Control 


22 


0.00 


Control 


44 


-0.18 



^Wf >ipKUt contfou < 0.0S 



The control and special control groups both show essentially zero changes. 
The experimental group shows a gain in hand raising whereas the special con- 
trol group does not (p < 0.0S).* The data support the predictions of Expec- 
tation States Theory and do not support an explanation based on "special 
attention** and ^'practice** arguments. It may also be noted that the special 
control group children did not show a marked drop in rate of voluntMring, 
as would be expected if they had interpreted the experimenter*s behavior as 
hostile, negatively reinforcing, or negatively evaluative. As mentioned, experi- 
menters for this set were carefully trained to be /id/ievaluative in Phase II. We 
conclude, both en the basis of theoretical analysis and empirical evidence, 
that the Expectation States Theory interpretation of the results of our experi- 
ments is most satisfactory. 



> Recent work by Professor Barbara Sobieszek at the University of Rochester has pro- 
duced results consistent with the results of th>s experiment. Subjects receiving differential 
•mounts of action opportunities randomly allocated by the experimenter did not use this 
to form differential expectations for ability, even though this was the only differentiating 
inform". liDn available to them under laboratory conditions. This result, obtained under 
conditions more highly controlled than ours, increases confidence in the results of the 
special control group experiments. VV^ thank Professor Sobieszek for making these results 
available to us prior to publication. 
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VI. EDUCATIONAL IMPLICATIONS OF RESEARCH TO DATE 
AND FUTURE WORK 

To place in perspective the work reported in this chapter, it must be seen for 
what it is intended to be: 9i first step towards applying some of the basic con« 
cepts and assumptions of the propositions of Expectation Suites Theory to 
naturalistic settings. Such attempts involve a large number of operational and 
procedural difficulties. To our knowledge, the only other sustained attempt to 
apply concepts and assumptions from Expectation States Theory for practical 
ends in educational research is represented by the work of E. G. Cohen and 
her colleagues (1968, 1970). These investigators also report difRculties with 
Uisk development and experimental design like those encountered in o**" 
vesearch program. 

Despite some difficulties, it does seem possible, using relatively simple 
techniques, to intervene in a natural situation and to change the expectations 
of children so as to affect their subsequent behaviors in ways that should 
further educatiouiil goals. (Sec also Entwible & Webster 1972, 1973a, 19736.) 
Work of the Cohen group similarly suggests that it may be possible to over- 
come the negative effects of being black in producing differential performance 
expectations in mixed racial groups of grade school children. 

A. REUTION TO TEACHER EXPECTANCY RESEARCH 

Results of teacher expectancy studies have been disputed in some cases (see 
Barber and Silver, 1968) and unequivocally negative in others (Claiborn, 
1969; JacobSp 1969). The range of results of our own experiments points to 
possible sources ofdifficulty in the teacher expectancy work. First, while most 
children show slight increases in performance outputs even without much 
encouragement (the control groups), some children fail to manifest increases 
in performance even with extensive positive evaluations. Earlier we pointed 
out that white experimenters may not be able to provide very effective evalua- 
tions for black children. More recent work.(Entwisle & Webster, 19736) sug- 
gests that racial mismatch between adults and children is complex in its efTect, 
interacting with social class as well. 

Second, Expectation States Theory assumes a task orientation on the part 
of students and our experiment assumes a task where ability is equal or irrele- 
vant: Both assumptions may be violated often in the teacher expectancy work, 
for many school children have little ego involvement in academic pursuits, 
and many alrc;iJy have firm ideas about their own level of ability. To the 
extent that chiklrcn have ncccss to ohjcctisc standards or to alternative others 
for cvalualint: their pcrroriu;inccs, wc would expect that the teachers* expecta- 
tion fur thcin would decrease in importance. 
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A third point, made by Claiborn (1969), is that there may be no chan^ :s in 
teacher-pupil interaction—no classroom analogue of our Phase II treat* 
ment. If teachers perceive pupils to be of high potential, teachers may alter 
their behavior, but teachers vary in how they change their behavior to suit 
childrcn*s ability (Kranz, Weber, and Fishelli 1970). For example, Kranz et 
al, show that some teachers behave similarly towards high- and average- 
ability children but differently towards low«abiIity children. If, as in most 
teacher expectancy experiments, a teacher is given false reports about stu« 
dents* potential, she might or might not change her behaviors toward the 
designated children depending upon how her own behavior pattern is 
expressed. Also, of course, since expectations typically flow from evaluations, 
which in most classrooms are expressed by peers as well as by teachers, 
changing expectations of only the teacher may not be sufficient to produce 
changes in children's self-expectations. Recent work by Cohen and Katz 
(1972) shows that expectations of both white children and black children of a 
work group must be molded if black children are to improve performance. 

Perhaps what is needed most at this point is more fine-grained analysis of 
exactly what behaviors accompany changed expectations. Our own work is 
one approach to this. Other approaches besides those already mentioned 
include Meichcn^ urn ct al.*s observations that expectancy instructions 
(identification of "late bloomers*') cause some teachers to increase positive 
interactions with students, or to decrease negative interactions.' Positive 
interactions included conveying encouragement^ praise, or any attitude of 
satisfaction. Also Brophy and Good (1970) observe that teachers tend to 
praise more those children for whom they hold high expectations, and to 
demand more in the way of performance from them. 

One of the major results of this work is the experimental task itself. It 
offers a means of manipulating (raising) young children's expectations with- 
out the drawbacks that affected ways of manipulating expectations used pre- 
viously. The behavior influenced—frequency of hand raising— is generally 
considered to be important educationally. Hand raising leads to greater 
participation. The importance of active participation to learning is too well 
known to require documentation. In future reports, we hope to study further 
how this procedure affects racial, age. and SES subgroups of the population. 
Further work will extend the basic experimental design to examine the rela- 
tion between expectation change and structural variables like sociometric 
standing. 

Also, further work will study students* expectations for academic perfor- 
mance as a function of their own feedback over time. What happens, for 

2|n other iii>Lincc\ when expcciancy or teacher expcciancy cxperimenis have Tailed to 
achieve rcMills (hat were aniicipaieil (CiolUsmith. 1970: l leming unJ Anitonen, 1970) an 
analysis iif liiutingN m icriiisi^r I n SuxWs Ihcixy may be hclprul. l or example, it 

may be lhat ihc leather lines no\ atlir expeelaney because the slu<lenls do not accept the 
\alidity of the teacher's expectations. 
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example, to a child's high expectations for himself when he gets o low evalua- 
tion (bad report card) from a teacher? If a child has low expectations for 
himself, and his parents have high expectations for him, how does his class- 
room behavior evolve? 

What is the potential of this kind of research for educational purposes? 
There is value first of all, in making explicit parallels between social psycho- 
logical research and research in classrooms. As has been so frequently noted, 
there is a surprising gap between these two fields of research that hinders both. 
By linking classroom research to a body of pre-existing theory, one gains 
coherence and explanatory leverage on a whole body of research findings. 

The experimental maneuver reported here may not be potent, by way of 
long-term educational effects, however. The present procedure, as earlier 
pointed out, does have the distinct advantage of avoiding the use of false test 
scores or of any form of outright deception. It also has the advantage of 
working directly on a child's expectations rather than on the expectations of 
some other person for the child. It is difficult, however, to imagine using a 
maneuver like the one described here over and over to obtain broad effects in 
raising children's expectations. This procedure may nevertheless have useful- 
ness over the short term in two ways: 

1. The procedure may act as a pump primer. If a child's expectations for 
himself are suddenly raised, as in the experiment, he niay alter his actions 
for a short time in ways consistent with his increased expectations for 
himself. Any intervention wiihln the circular scries of events->improved 
self-expectations leading to better performance leading to improved 
teacher evaluations—may be effective. The short-term change in a child's 
actions induced by an expectation raising procedure might thus fire a 
chain reaction that would tend to continue once started. 

2. This procedure and others that could be invented may give precise sug- 
gestions as to how teachers can convey positive expectations. Some teach- 
ers may wish to convey positive expectations but not be very adept at 
it. The suggestion here is that praise for one activity (word giving) leads 
to an increased level of another activity (hand raising) at a later date. In 
classrooms where many activities occur, then, the teacher need not wait, 
perhaps in vain, for a praiseworthy performance in arithmetic to improve 
a child's expectations for himself in the area of arithmetic. If the child 
can be encouraged for one sort of performance this may generalize to 
yield increased participation across the board. In fact, some situations 
may be used primarily to improve expectations rather than for learning 
perse. 

The results of the Special Control Group experiments indicate quite clearly 
that it is the unit evaluations that determine children's expectations, or at 
least that by comparison with other features of the situations, such as being 
given action opportunities and special attention, the unit evaluations are far 
more Important. An importont extension from this is that one means that 
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mieht be thotiglit tu raise childrcirs expectations -calling upon them more 
often— probably is inefrcctivc. This conclusion may be counterintuitive in 
some cases, Tor it might be thought that if the teacher calls upon certain chil* 
dren more frequently this would indicate to them that she thinks highly of 
their answers. 

However, as was shown in the Special Control Group experiment, this 
"special attention" or differential allocation of action opportunities is proba- 
bly ineffective in changing expectations. Thus, in order to apply the results of 
our experiments to raising children's expectations, it is imporUnt to bear in 
mind that calling on the selected children more frequently will not by itself 
improve their expectations greatly. What is probably required is to praise 
their responses, to give them extensive positive evaluations of their perfor- 
mances. 

To surpmarize results at this stage of our research, we review the three 
general goals adopted at the outset. First, we hoped to apply a formal theory 
to analyze interaction in classrooms, particularly what has been called the 
"teacher expectancy efTect," and to show that some of the previous results in 
this area could be incorporated into the scope of Expectation States Theory. 
Second, we hoped to develop techniques of intervention that would induce 
specific changes in classroom interaction to improve children's learning. 
Third, we nvcdcd to develop a task— the story telling game— that could be 
used easily and cRcctively. The overall results of our experimenu indicate 
some success in meeting these goals. 

More specifically, the experimental results showed increases in expectations 
for all groups except white, middle class female third and fourth graders, and, 
initially, black third and fourth graders. In the case of the black children, an 
additional set of experiments suggested that the earlier failure to produce 
results may have stemmed from racial mismatch between experimenters 
and children (white experimenter, black children). Preliminary experiments 
suggest that another kind of racial mismatch (black experimenters, white 
children) will not vitiate the procedure— black experimenters may turn out to 
be more generally effective across all kinds of subgroups of children than 
white experimenters Research with black experimenters and white children 
and with racially mixed (black-white) groups is now commencing. We are 
also observing teachers in classrooms to get information about how teachers 
"naturally" convey expectations, and about how other students convey expec- 
tations. The effect of the treatment was greatest for rural children, perhaps 
because they attend schools where few breaks in the routine occur. The schools 
are in remote areas, and so are seldom included in research studies or special 
programs. Oih-r schools where this research was carried out are almost con- 
tinuaiiy involved in activities initiated by persons not on the regular staff. Thus 
:he urban and suburban children may be more "sophisticated" in terms of 
serving as research subjects and less susceptible to any attempted treatment. 

Further work in this program may be organized into three general cate- 
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gorlcs. First. \vc arc working to develop additional tasks that may be used in 
the research. It seems desirable to develop tasks unrelated to academic activity 
for example, athletic tasks and social or leisure-time tasks such as hobbies' 
Second, we hope to be able to study factors that govern how successful the 
intervention is. For example, sex of the experimenters may have a dilTerential 
effect, and probably operates in rather complex ways involving cross-sex and 
same-sex combmations. Factors of a diflerent kind such as the cemrality of 
sociometric ranking of the children and their average level of performance in 
school may also be related to expectation raising. Third, we hope to design 
studies based upon other aspects of the theory. One such study springs from 
Assumption 3. which asserts that action opportunities and evaluations will 
bt distributed in accordance with expectations held for others. In some cases, 
perhaps when children are interacting together in the absence of an adult the 
assignment of an action opportunity may be equivalent to the voicing of a 
positive evaluation. 



Phsss I 



Phssi II 



Phase III 



Control 
5-s 



ona story is pro- 
duced (12 words); 
notvaluations; 
level 0? volunteer- 
ing observed 



control S's have 
story «aad to them 
by another adult, 
from 12 to 16 
control S's Join in 
one group 



rtpeet Phese I, 
with same control 
^s and experi- 
mentel Su as in 
Phase I; experi- 
mentere ere rote- 
ted so the experi- 
menter ieunewere 
of identity of ex- 
perimentel 



Experi- 
mental 
S's 



one story is pro- 
duced(12words); 
no eveluations; 
level of volunteer- 
ing observed 



experimentel S's 
make up etory 
individuelly with 
the seme experi- 
menter they have 
seen in Phase I; 
rirceive ell positive 
evaluations; ex- 
perimental S's 
join control S's at 
end of this phase 



repeet Phase I. 

with seme control 
S's end experi- 
mentel S's as in 
Phase I; experi- 
menters ere rote- 
ted so the experi- 
menteris uneware 
of identity of ex- 
perimental S's 



FictRK 7-2 

Summary orcxpciiiiicnUil procedure 



ERIC 



C - 3J 



APPENDIX A 

Formal Statement of Propositions 
Definition 1 

A situation is tasbsituation 5 if and only if it conuins: 

a. at least two actors, p and o. making performance outputs; 

b. an actor, r, making unit evaluations of those performance outputs; 

c. no previous expfctations held by p and o of their own or Mch other's 
abilities at the task; 

d. task orientation of all actors; 

e. collective orientation of all actors. 



Definition 2 

e is a source for p in task-situation 5 if and only if p believes that c to more 
capable than p of evaluating performances. 



Assumption 1 

Iji task-situation 5. if f is a source for p, then p will agree with #*s unit evalua- 
tions of any actor's performances. 



Assumption 2 

In task-situation 5, if p evaluates a scries of performances of any actor, then 
he will come to hold an expectation state for that actor which is consistent 
with those evaluations. 

Assumption 3 

In task-situation 5. if p holds higher expectations for any actor 0% than for 
another actor Oj*. 

a. p will be more likely to give o, action opportunities than Oi ; 

b. p will be more likely to evaluate positively 0|'s future performance out- 
puts than Oj's; 

c. in case of disagreement between o, and Oj, p will be more likely to agree 
with 0| ; 

d. p will be more likely to accept 0t than 0a as a source. 
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Assumption 4 

In usk-situalion S, the higher the expeciations an actor, p. holds for self 
relative to the expeclalions he holds for o : 

a. the more likely is he to accept a given action opportunity and make a 
perrormance output; 

b. in case of disagreement with o, the more likely is he to reject influence. 
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Abstract 



This research studies hew the status characteristics of adults 
and children affect adults' ability to raise a child's eacpectatlons 
of his ovm perfoxnance at school-like tasks. White adults are effective 
at raising expectations of white children or black children In mixed 
racial work groups; black adults are effective with black childroi but 
apparently not with white children in mixed groups. These results, 
both consistent and Inconsistent with previous findings for homogeneous 
groups, afe interpreted in light of the children's relative position 
in SES with respect to members of their own race. Unlike most research 
related to the effects of desegregation, this research examines both 
black children's and white children's reactions to black adults. 



EXPECTATIONS IN MIXED RACIAL GROUPS^ 



One way to conceptualize educational research Is by the unit of 
analysis. Traditionally educational psychologists have analyzed 
Individuals , for example, hew a person's perceptual skills affect his 
reading achlev^ent. Sociologists of education, on the other hand, 
have analyzed social aggregates , addressing such Issues as the school 
perfoznance of subcultural or minority groups. Both lines of inquiry 
look at outputs, achievements of the IndividucuL or of the social group. 
Both tend to skip over how inputs are converted into outputs of achieve- 
ment, in particular what Intezpersonal events, as the child interacts 
with his family group or with his peer group, foster achievement. If 
middle class children read better than lower class children, exactly 
what happens day by day in ^ middle class setting that brings about 
superior reading? Vfhat social processes or activities occur to account 
for the differences? 

This paper tries to trace out sequences or patterns of social 
interaction that lead children to make outputs. The unit of analysis 
is the school child nested in a group with peers and one adult. This 
unit bridges both the psychological and sociological units of eumlysls 
mentioned above, and focuses on the articulation of the child with his 
group of significant others. The aim is to trace out how social feustors 
and processes are translated into esgpectations of children and how 
these expectations, in turn, affect children's performance. 

In the experiments reported here, adults atteiapt to alter children's 
performance expectations for themselves. In all cases adults attesqpt to 
manipulate upwards a child's performance expectations. One aim of the 
research is to point out what an adult can do to raise a child's 
expectations. Another aim is to see how status charEu:teristlcs of adults 
and of children affect expectatlon-iaising. Is a black adult, for 
example, more or less effective than a white adult in raising a black 
child's expectations? Does a child voliuiteer more often in a mixed race 
group than in a group where everyone ' s race matches his own? 

The program to raise children's perfoznance expectations developed 
frcKQ several sources, among them a theory describing the detezminants 
and consequences of performance expectation stat es in small task-oriented 
groups (see Berger, et al., 1972). According to this theory, e:g)ectation 
states— which are roughly equivalent to beliefs about ability— are formed 
for actors as the result of evaluations made of their per'^ormances. 
Once formed, expectation states tend to persist, and they affect both 
the ways in which actors treat each other, and the types of actions they 
initiate. Some of the interaction consequences of expectation states, 
particularly self-confidence and willingness to engage in interaction, 
seem very closely related to problems encountered in educational situations. 

The laboratory, however, is remote from the classroom. Producing 
practical results in a naturalistic setting leads to problems somewhat 



different from those faced in a more highly controlled social psychological 
laboratory. What characteristics, for exao(ple, must be possessed by 
an adult In order for this adult to function as eui effective raiser of 
children's eaipectatlons? In general we estpect that an effective expecta- 
tlon-ralser should possess the characteristics of a "significant other" 
In Sullivan's (19^7) language: he should be warm, trustworthy, etnd have 
the potential for affective ties to the children; he should also be 
perceived as highly cosipetent to evaluate (Sobleszek emd Webster, 1973) • 
There may glLso be particular historical considerations which operate 
In educational settings, for example, the effects of racial mix of 
children In the school. 

Self-esteem or the self-image in relation to educational performance 
has been repeatedly Investigated (Brookover, Thomas, & Peterson, I96U; 
Wylle, 1963) but aside from consistent small positive correlations be- 
tween self-image and school achievement, little so far has emerged to 
suggest how self-esteem Is acquired or how It leads to varying behavior. 
Perhaps only when the Individual Is enmeshed In his network of significant 
others can one study what raises or lowers self-esteem. This research 
focuses on socleJ. events or processes that lead to small changes In 
children's behavior or self -view, and looks at what specific behavior 
by whom brings about those Cheuiges. We attempt to see how adults Inter- 
acting with children can raise the child's expectations, and how social 
feustors affect the process. 

TMs research^ besides linking laboratory research to practiced 
concerns, cJ.so relates closely to two other current Issues in educational 
research. 

There is, first, the issue of how segregation or integration actually 
affects children's perfonnance in school. Intra-classroom data on effects 
of integration for grade-school children are scarece. Little is known 
about the social-psychological factors which establish racial identity, 
or about exactly how social processes in mixed racial groups affect 
achievement. It is often assumed that blacks, because they are discriminated 
against and are of low status in the larger society, coae to have low 
e^gpectatlons for themselves. This set of low es^ectatlons Is thought 
to lead to a generalized low opinion of the self (low self-esteem) as the 
low expectations are repeatedly confirmed. Recent findings (Beers, 
1973; Rosenberg & Simmons, 1971; Soares & Soares, I969) of relatively 
high self-esteem for black children, in some cases exceeding levels for 
whites, run counter to these suppositions emd point up the complexity 
of Integration effects. 

Expectations, and expectation raising or lowering, are shaped in a 
social context. Wliile adult blacks may be discriminated against, children 
who live in eai-black neighborhoods emd who attend segregated schools 
may be largely unaweure of "outside" events. The paradoxical findings of 
high self-e3g;>ectatlons for blacks may be partly explained by blacks' 
relative insulation, as suggested by Rosenberg and others. Data here 
provide evidence in support of an insulation effect for, as will be seen, 
black adults are uniformly effective at raising expectations of black 
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children while white adults are not. A child dependent upon other 
people like himself for evaluation may be in a cocoon where the larger 
society is almost irrelevant. 

Of equal importance, performance expectations have both "self" and 
"other" components: a child's expectations for himself are defined only 
with regard to a specific other individual. From our perspective, it 
makes no sense to speak of "general low self-expectations" supposedly 
held by blacks. If black children have low expectations for their own 
performance, it is only with respect to specific other children , such 
as white children. A black child may have averag? or high expectations 
for his own performance when he is in a gro\]p of other black children, 
and then display all the behavior patterns of & child with low self- 
expectatlons when he is in a group containing white children. Our 
research is grounded in a theory which explicitly emphasizes the importance 
of social context— separation from society as a whole, and racial 
composition of the Immediate group— in forming the child's view of 
himself. 

Second, the issue of teacher expectancies and the self-fulfilling 
prophecy in educationaJ. settings has led to a shower of studies and to 
some controversy. Contradictory and equivocal findings have been frequent. 
The experiments reported here trace how teachers' expectations, the 
presumed "active agent" in the Pygmalion studies, may be conveyed to 
students. Expectations teachers hold for their students may be potent 
determinants of students' actual perfoxmances because teachers translate 
their e^qpectations into responses that affect the child's own expectations 
for himself . Although there are many studies in the teacher expectancy 
area, none but ours as far as we know, tries to link the child's 
expectations into the process. The present experiments focus directly 
upon the child's expectations, which may be the most inqsorteuit part of 
the "Pygmalion effect." If a teacher expects more from a child, he gives 
evidence of his high expectations for the child. The child, besides 
producing more in response to the teacher's demand, probably changes his 
view of himself so that it conforms with the teacher's view of him. In 
other words, something cognitive and something persistent occurs In the 
child when the teacher gives evidence of high expectations. 

A child builds performance esgpectations for himself on the basis of 
responses supplied him by significant others. These expectations then 
persist as part of his ability self-concept. We are investigating how 
children actually form expectations about their own ability to do school 
work, and when these expectations shift as a consequence of actions taken 
by others. What set of persons — parents, peers, teachers, others- 
constitute the set of significant others for this process? Putting the 
child in the loop, as the present research tries to do, but which other 
teacher expectancy studies fail to do, allcwF a number of related issues — 
the effect of praise, reinforcement propert Icq of adults — to be integrated 
into a consistent conceptual framework. Fivrther discussion of these 
relationships will follow presentation of the expexlments. 
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Previous Work 



In earlier studies (Entwlsle &> Webster, 1972, 1973, 197U) procedures 
were devised for raising children's eaqaectations for their own performance, 
and for lna)rovlng their willingness to participate in a naturalistic 
classroom task. An adult woiks with one child selected from a four-child 
group and follows a prssriK^ed pattern to raise that chil-^'s e:?pectation8 
(see below). ETqperlmont" far, with one exception, have been homogeneous 
with respect to race; th..w is, groups of black children or white children 
met with adult experiment jrs who were of the same race as n:he children. 
White children from grades one through four and from both rural and sub- 
urban backgrounds generally responded significantly to the expectation- 
raising treatment administered by a white adult. KLack adults with black 
inner-city children were also effective. White adults with black inner- 
city children were the single exception in terms of racial matching and 
this combination was relatively ineffective. 

Research reported in this paper extends the previous work by studying 
several kinds of racial mixing. Some experiments focus on black adults 
working with white suburban and white rural children's groups. Other 
experiments deal with mixed racial children's groups, where both white 
and black adults work with groups where two children are white and two 
children were black. After reviewing the e35)erimental procedure, we will 
present separately results for these two new lines of research, (l) black 
adults with white children's groups, and (ii) black or white adults with 
Integrated children's groups. 



The Procedure 

Children in second, third, and fourth grades who attended schools in 
the greater Baltimore area served as subjects. Middle-class young adult 
women, some black and some white, served as experimenters. At the outset 
the children met together and groups containing four children, all of the 
same sex and grade, were formed and designated as "Team 1," "Team 2," etc. 
Children were told that their team was about "to play a game making up 
stories," and that the teams would be competing for a prize: everyone on 
the winning team would get a prize. Insofar as possible, members of a 
team come from different classrooms. The teams then went to separate rooms, 
with one experimenter accompanying each team. 

The experiment consists of three phases. (See Entwlsle & Webster, 
1972 for a more detailed description of the experimental procedure. ) 
In Phase I each team supplies 12 words to fill in a story skeleton. The 
storj- skeleton consists of Incomplete sentences: "Once upon a time there 

was a ." Children are Instructed to raise their hands if they can 

think of a word(s) to complete the sentence. The experimenter cautions 
the children not to raise hands to volunteer a word unless "you are sure 
you have a very good word— one that will help the team score." Given this 
emphasis on task performance, hand raising is taken as an operational 
measure of each child's level of performance expectations for himself. 
The experimenter notes unobtrusively how often each child volunteers 
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(raises his hand), then chooses one child to give the "team's word," 
being careful to choose each child equally often as a respondent. The 
experimenter is businesslike and does not praise or otherwise reinforce 
or evaluate responses in Phase I, merely noting them down. 

In Phase II one child (whose rate of volunteering in Phase I is 
near the median for the group), is selected for the expectation-raising 
treatment. In Phase II the selected child makes up a story by himself 
using a new story skeleton. Every word he gives, as well as the overall 
story, receives strong praise and positive evaluation from the experi- 
menter — a treatment which our emedysis indicates ( see Entwisle % Webster, 
1972, 1973^ should raise his performance expectations, and later, hi a 
rate of group peurticipation. Note that the treatment to raise the 
child's e:q>ectations involves praise of responses, not of hand-raising; 
when a child is completing a story by himself he does not, of course, 
need to volunteer. 

In Phase II the three children not selected frcmi each team for 
the expectation- raising treatment leave the room where Phase I is carried 
out. These untreated children of each team constitute a control group, 
and spend Phase II in the centred room listening to a story being read, 
(other ei^erlments (Entwisle & Webster, 197^^) have established that 
this story-listening control treatment is equivalent in its effect to 
a control treatment in which an individual control child fills a story 
skeleton with an e^erimenter bat the e:Q>erimenter remains neutral. 
This equivalence points to the raising of e^gpec tat ions, as a consequence 
of positive feedback from the experimenter, as the crucial part of the 
Phase II treatment rather than to such things as telling a good story, 
isolation with an adult, or other incidental accompaniments of the Phase II 
treatment. ) 

Phase III is a repetition of Phase I. The original teams reassemble 
in the experimental rooms and use a third story skeleton to produce a 
new story. The experimenter again remains neutred, calling upon each 
child an equal number of times, and notes how many times eeu:h child 
raises his hand. Experimenters are rotated between Phases II and III 
so that during Phase I IT they do not know the identity of the child who 
received the expectation- raising treatment in Phase II. 



Results 



The analysis focuses upon changes in the rate of hand raising. 
The question at issue is whether the child who received the Phase II 
treatment raises his hand more in Phase III than in Phase I. His change 
between Phase I and III is conpeu'ed to chemges between Phase I and III 
for a selected child who received the control treatment in Phase II. 

A word is needed about the choice of an individual control child 
whose gain is used as the baseline against which to measure gain by the 



e3g)erljQental child from the same group. For this work a stringent 
selection procedure, which entails considerable exclusion of data, 
was adopted for several reasons. First, group climate is probably 
Important in its impact on volunteering, so children frcm the same group 
are matched in the analysis. Second, close Phase I matching of e3Q>eri- 
mental and control children was undertaken to guard against biasing of 
results in favor of predictions. Biasing can occur in three ways: 

(a) through the mean score for three control children tending to exceed 
the average score of experimental group children in Phase I, because 
although children occasionally volunteer 11 or 12 responses in Phase I, 
it rarely happens that a child volunteers once or not at all; (b) 
through ceiling effects when scores in Phase I are close to the maximum 
possible score (12); or (c) through regression effects, where disparate 
Phase I scores might lead to regression effects either through the 
experimental child's score regressing upward if his score in Phase I 

is relatively low, or through the control child's score regressing 
downward if his score in Phase I is relatively high. 

For these reasons, the following rigorous procedure for screening 
e^eriments was adopted. First, a particular control child's score, 
rather then the mean score of three control children in Phase I, was 
taken as \;he baseline. The two child z^n are participants in the same 
experimivnt. Second, only experiments where e:cperimental and control 
children are closely matched in Phase I are included for analysis. 
The criteria for close matching are (a) unless the experimenal child's 
Phase I score and the control child's Phase I score can be matched within 
two units or less, the experiment is discarded (regression effects ); 

(b) if the Phase I score for the experimental child or the child selected 
to be his control is 11 or 12, the etqperiment is discarded (ceiling 
effects) . 



Experiments with Black Adults and White Children's Groups (Suburban and Rural) 

One set of experiments involved black adults with white rural third- 
graders. The black adults were middle^class black women students at a 
private university. The children lived in a farming area about 30 miles 
north cf Baltimore City and attended an all-white school. The mean gain 
in volunteering between Phase I and Phase III for treated children over 
gain for control children in 1? experiments is 1.94, significant well 
beyond the % level. 

Another set of experiments involved the same black adults with white 
suburban third- graders. These children attended an all-white school 
located Just over the city line from Baltimore in a lower middle class 
area. The mean gain in volunteering for treate; children over gain for 
control children in 28 e:Q)eriments is 1.66, also signifLcanb beyond the 
% level. 

When data for black adults and white suburban or white xvlt&I children 
are combined with earlier data (Entwif' j & Webster, 1972, 1973, 197U) 
a 3 X ? factorial design can be manufactured. This design is comprised 
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of third graders dravm from three residential loci treated by ej^erlmenters 
of two races, black or white. Table 1 sumoarizes the mean gain of 
treated children compared to control children for the various combinations 
of experimenters and children. 

The reader should be warned that there are procedural differences 
among the esgperiments. The experiments summarized in Table 1 were 
carried out over a 3-yeeu> period. Both male and female experimenters 
were used in some of the early work, and experiments were aggregated in 
the earlier euialyses. Nevertheless except for experiments with bleusk 
inner-city tliird-graders and white experimenters,3 all combinations produce 
fairly sizeable gains in expectations of treated children conpeu>ed to 
children given a control treatment. In all these experlmants gx-oups 
of children were homogeneous with respect to race, so major interest 
attaches to the effects of matching or mixing of race between adults 
and children. As men. loned, all combinations are effective except the 
combination of white adults with black inner-city children. 



Experiments with Black Adults, White Adults> and Racially Mixed Groups 
of Children 

In another set of experiments, racially mixed groups of children 
peurticipated in a procedure identical to that described above. Each 
group contained two white and two bleusk children. These racially mixed 
children's groups met with white e^erimenters or black experimenters. 
As will be e3q)lalned, a white child or a black child was selected to receive 
the expectation-raising treatment, and the remaining three children, two 
of one race and one of the other, received the control treatment. 

The child chosen for the expectation-raising treatment in Phase II 
either matched or differed from the experimenter in terms of reuse. This 
gives four possible types of e:Q>eriments (a 2 X 2 factorieJ. design based 
on race-of-experimenter and race-of-child) with black e:gperimenter euid 
a black treated child, blrick experimenter and a whitct treated child, 
white experimenter and a black treated child, and white es^erimenter and 
a white treated child. 

An integrated school with about 60^ bleusk and ^K)^ white children in 
a lower-class Baltimore neighborhood provided subjects. Carrying out the 
expariments led to exhaustive sampling of white children enrolled in 
grades 2, 3> and k and to an 80^ sample of black children in those grades. 

In advance it was difficult to predict what would happen with 
racially-mixed groups of children. Perhaps in mixed groups expectations 
of black children would be difficult to raise. As far as race of adults 
is concerned, black adults had proved effective with white rural and 
white suburban children (see above) but white adults had been ineffective 
as expectation raisers for black inner-city children. The same criteria 
for screening experiments mentioned above led to selection of 29 experi- 
ments with mixed racial groups. Ceiling and regression effects should 
be minimal for these 29 experiments. 
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The gains In volunteering for treated children coiqpared to gains 
for control children in 29 esgperiments are presented in Table 2. For 
example, in lU experiments where the experimenter was white and the 
treated child was black, the treated child on the average gained two 
more units between Phases I and III than the control child. Other mean 
gains listed in the first row of Table 1 show analogous comparisons for 
other combinations of experimenter's race and child's race. One-sided 
t-tests appropriate for matched groups show significant differences 
favoring treated children for three e:g?erimenter- subject combinations 
(white E with white S's, white .E with black S's, and black E with black 
S's), but not for black E and white S's. Variances associated with 
these means are relatively homogeneous. 

Further examination of the data for the black E-white S groups, 
where there is not a significant gain, was undertaken. If an alternate 
set of criteria for selecting experiments is used, a few more experi- 
ments can be included in this condition to further check the conclusion. 
Accordingly, if experiments are selected where matching of control and 
experimental children is not within 2 units but where ceiling effects 
are guarded against and where instead the control child and experimental 
child are matched for race (11 groups), the mean gain score drops to 
neeu: zero (.09). These results increase our confidence in the "no 
difference" conclusion for this condition. These same alternative 
criteria for selecting experiments when used for the white E-blewsk S 
condition lead to confirmation at a higher level of confidence of the 
previously established finding of a gain for experimental children. 

The matched- reuse experiments (white E-whlte S and black E- black S) 
show statistically significant gains, even though the number of experi- 
ments is small. The findings for matched- race, combinations observed here 
are consistent with previous findings for matched- race E's and S's 
where members of teams were homogeneous with respect to reuse (see Table 1). 
Further e:q)eriments could not be run because the subject pool was used up. 

Table 3 shows the mean initial (Phase I) rank in group of escperimenteJ. 
and control children. In all groups, white and black children volunteer 
about the same amount in Phase I, with black children ^volunteering only 
slight].y more theui white. This table suggests that black and white 
children's performances are initieOly much the same. 



Discussion 



Status Difference and Status Distance 

Results from the present experiments, together with earlier 
experiments reviewed above, make a complicated picture. To review the 
findings: (1) when the race of the e:gperimenter matched the race of the 
treated child and children's groups consisted of a single race, significant 
increases in the expectations of the treated child occured. Black experi- 
menters have been uniformly effective in raising the expectations of 
black children, and white experimenters have been uniformly effective 
with white children. (2) In the one case previously reported (Entwisle & 
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Webster, 197^) where race of experimenter differed from race of children, 
white adults were ineffective with all-black ixmer-clty children's groups. 
The findings of the present experiments shofw (3) signifioeuit increases 
for black adults with all-white groups of rural and suburban children, 
(4> but no significant effect for black adults Interacting with white 
children drawn from mixed racial groups even thv^ugh {3^ white adults are 
effective with black children drawn from racially mixed groups. Effects 
of racial mismatch are apparently complex. Two possible es^lanatlons are 
suggested, one based on reuslal mismatch and the other on social class. 

Mixing of races between experimenters and children may mEike children 
less likely to see the adult as a credible source. The black child 
encouraged by the white experimenter may feel he is being ing^licitly 
compared with other white children even though the children surrounding 
him are all black (finding 2 above) or a white child encouraged by a 
black experimenter may feel he faces "stiff competition" when he is in 
an Integrated group led by a black experimenter (finding k above). But 
this explanation is not entirely satisfactory; it is hard to see why this 
same explanation would not apply equally well to the white child in an 
all-white group encouraged by a white experimenter (finding 3 above) or 
to the black child encouraged by a white experimenter, who cooipetes 
against white children in an Integrated group (finding 3 above). 

Supplementary evidence also weakens the racial-mixing e^gslanatlon. 
In the experiments where children's groups were racially mixed an analysis 
of varleuice performed on rates of volunteering in Phase I with reu:e-of- 
experimenter emd race-of-chlldren taken as two fixed-effect factors, 
shows no slgniflceuit main effects or interaction effects. Children of 
both races volunteered about the same amount whether the experimenter 
was white or bleusk. In Phase I white children volunteered somewhat more 
for black E's (7>71) than for white E's (6.71), although not significantly 
more in 28 and 26 experiments respectively. This suggests that initially 
at least, white children respond at about the same level to both types 
of experimenters, and, if anything, are somewhat more responsive to black 
adul^;s. This is confirmed in Table 3> 

Social class or SES level is a major point of difference between 
groups of children in this and earlier studies. Altogether four residential 
loci are Involved: rural, suburban segregated white areas, segregated 
inner-city black areas, etnd an Integrated area located at the black-white 
inter feu:e in Baltimore. Blacks in the Integrated area are considerably 
higher up the SES ladder than blacks in c(xnpletely segregated ghetto 
neighborhoods in east euid west Baltimore where the studies with all-black 
children's groups were carried out. On the other hand, whites living 
in the integrated area in Baltimore City where the racially-mixed groups 
were secured are much below the. SES level of the rural or suburban 
whites, being at the low extreme of white SES in Baltimore. 

This suggests that children of very low SES relative to others of 
their own racial group— whites from Integrated neighborhoods or inner-city 
blacks living in the ghetto— may perceive adults of the opposite race as 
hostile. Opposite- race adults, if they are perceived as hostile, would 
not be accepted as expectation- raisers by such children. 
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Precisely why low SES children of both races should be most likely to 
perceive hostility from otiier-race adults is not clear, although Itetz 
(1960, 1970) reports that white adults are seen as hostile by black childrei 
Prom the present experiment it may be possible to refine Katz' idea as 
follows: "Children of very low status in any group are apt to perceive 
adults as hostile if adults belong to a different racial group from the 
child." Our complex results can be interpreted in a single framework if 
this explanation is used. 

Social class or residential locus is a second parameter in addition to 
race which leads to social distance between e:q?erimenters and children. 
Perhaps to achieve credibility as an expectation-raiser the source must be 
far enough removed to be perceived as con^petent to judge, but not so far 
removed that he recedes as a significant other. 

These e:Q>eriments, carried out over a long time span, include some 
differences among experimenters that may affect the findings. The black 
experimenters who were effective with all- black inner-city children's 
groups were lower middle class, considerably lower in SES than the white 
experimenters who worked with similar groups at the same time. There was, 
in addition to matching on race, less social class distance between 
experimenter and children in the case of black E's and black ghetto 
children than for white E's (upper middle class). The black eacperimenters 
who were ineffective with white children in mixed groups, on the other 
hand, happened to be upper middle class blacks, so that again race and 
social class were strong points of difference between adults and children. 



Racial Matching 

A number of studies outside the teacher expectancy fifeld examine 
interactions between race of children and race of experimenter when an 
adult reinforces (praises) or criticizes the child in simple performance 
tasks (see e.g. Allen, Dubanoski, & Stevenson, 1966). In most of these 
studies social class is not studied and sometimes the child's task is 
ambiguous. Nevertheless in general a same- race effect is found: children 
perform better, faster, or more effectively when the adult administering 
reinforcement is of the same race as their own. In our experiments to 
date (which involve hundreds of children in both segregated and integrated 
settings) we have found no exceptions to the effectiveness of racial 
matching over a wide range of children's socicO. class and age levels. In 
every experiment when adults and children are of the same race the adult 
is able to raise children's expectations. Interestingly, some recognition 
of possible advantages of racial matching in college tutoring programs 
occurs in practice. In a survey of h6 colleges having peer- tutoring 
programs, most programs with a large percentage of ethnic minority students 
have tutors from the same ethnic group (Reed, 1973). 



['chool IntcKration 

It is tempting to speculate about the relevance of the experiments 
reported here to problems and tactics of school integration. Naturally 
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such speculation should be guarded since the results here, although 
statistically significant, are based on relatively small, eurid "erhaps 
unrepresentative samples. Particular schools in the Baltioore area may 
not be good barometers of social climate in other schools in the same 
area, let alone elsewhere. Also the .xperlmental situation is organized 
around a single activity ( story- telling) , a single behavior (volunteering), 
and a isingle cognitive cheuige in the children (expectation- raising), 
whereas teachers in classrooms work with larger groups on many topics where 
cognitive changes involve learning rather than only expectation- raising. 
Nevertheless since little is known about how school integration affects 
school performance, particularly for young childi^n, soms extrapolation 
may be fruitful. 

To amplify a point suggested earlier, a conception of childrens' 
self-image which does not include reference to a particular social situation 
is misleading and over- simplified. A child holds performance expectations 
for himself relative to some other individual or individuals, performing 
a known task, if a black child holds low self -expectations, it is because 
he has been unable to perfom well (received negative evaluations), at a 
particular task, or because he is in a group of others whom he has reason 
to suppose can perform better theui he. Experiences of past success and 
failure by comparison with others in a group determine expectation states; 
if the relative amount of positive evaluation given a child changes, or 
if he moves from one group to another, his relative self- expectations 
and expectation-related behaviors will change accordingly. Recognizing 
the relative nature of expectations, as well as of the evaluation process 
by which they are built up, should permit intervention of several sorts 
to combat social sources of unrealistic low self -expectations. 

It Is often speculated that children of different races in integrated 
schools are likely to regard each other with suspicion or hostility. One 
effect of such hostility might be to depress voluntering rates in the 
experiments reported here, especially rates of black children. From another 
point of view. If children perceive race as a relevant status characteristic, 
they will form expectations for each other based upon their respective 
races (see Berger it al., 1972). The effect of this process also would 
be to depress expectations black children hold for themselves— especially 
in Phase I before the experimental treatment intervenes— and thus, their 
volunteering rates. 

The fact that black children do not have lower volunteering ranks in 
Phase I than whites in any of the experimental conditions suggests that 
neither of these processes occurred. There is no evidence that racial 
tensions between children depressed blacks' volunteering in Phase I, so 
children in this study may not perceive race as a relevant status character- 
istic in forming performance expectations for themselves and each other. 
Other data (presently unpublished) for older children in this same school 
reveal that both white and black children have many f ireds in school of the 
opposite race (even though they do not have cross-race friendships in their 
neighborhoods) and have no preferences in terms of race as far as "next 
year's teacher" is concerned. Surprisingly, one class of fifth-graders 
in this school was unaware that blacks constitute a numerical minority 
group until this information v;as told them by the teacher as she taught a 
social studies unit in intergroup relations. More research would be needed 



to investigate these suggestions systematically, bub the fact that this 
school has remained successfully integrated for several years perhaps 
has contributed to break "ng down some "interaction disabilities" based 
upon race. Whether the same climate would prevail in a newly- integrated 
school is not known. 

Related to this point, Cohen and Roper (1973) tried to increase 
performance expectations of black children participating in biracial work 
teams. Junior high aged children of both rtces from segregated back- 
grounds were brought together for experiments, and apparently all 
children formed ?.ow expectations for blacks and relatively high expectations 
for whites on the basis of the race status characteristic. To overcome 
effects of race in order to raise blacks' performance outputs, the 
expectations of white children for black children had to be changed 
as well a3 expectations of black children for themselves. Extraordinary 
efforts were necessary to accomplish this, documenting the strength and 
the persistence of racial bases for performance expectations. Black 
children in the present experiments came from an integrated background 
and apparently neither they nor their white classmates held low expectations 
for blacks' performance. 

History of previous contact, particularly equal-status contact 
between schoolchildren across races, and age differences, may best explain 
the difference among findings. Elementary school black children like 
those in the present study may have higher self -concepts than older 
children who come from segregated schools or who have been compared 
unfavorably to whites. The same general effects would also help explain 
the complex interactions found between race of adult, race of child, and 
child's social class level in this series of experiments and previous 
ei^periments of the same type. 

Most previous research on performance in mixed racial groups has 
involved college students. There is very little research on mixed racial 
groups involving children of elementary school age, or on desegregation 
effects for younger children (see Cohen, Pettigrew, & Riley, 1972). The 
work reported here may have some relevance to desegregation strategies in 
two respects. 

Kirst in a school which has been desegregated for longer than any of 
its present students have been in it, these data show an important finding 
by the absence of evidence that performance expectations are based upon 
race. One goal of school desegregation is to make race non- relevant as 
a sufficient basis for forming performance expectations. There is no 
evidence in our data that these children formed expectations for one an- 
other on the basis of race. By comparison with other results with children 
from segregated schools, these data are encouraging. Comparing these 
results of ours to results Cohen and Roper (1973) report may well indicate 
that sustained integration of an elementary school can break down the use 
of race as a basis for performance expectations. This constitutes what 
is to our knowledge one of the very few quantitative indicators of positive 
effects of desegregation. 
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Second, discussions of desegregation effects concern mainly reactions 
of blacks in schools where some classmates and teachers are white (see, 
e.g., Katz, 1968). Issues involving white children's reactions to black 
teachers remain virtually unexplored. In racially mixed classes, for 
example} will a black teacher or a white teacher be more effective? Are 
there beneficial effects in terms of positive intergroup relations from 
having first- and second-graders in white segregated neighborhoods taught 
by black rather than white teachers? Or what are the effects upon inner- 
city black children of white middle-class teachers? Compared to the 
efforts necessary to bus children between schools, it would be relatively 
easy to "bus" teachers. This aspect of school integration, the deployment 
of the black or white teacher, although easily manipulable, has been little 
investigated. Itoat studies so far concern "racial balance" where racial 
balance has to do only- with the racial mix of students. The present 
research begins to broaden research along the teacher dimension and suggests 
what factors— such as status distance and SES matching*- interact to affect 
teacher effectiveness as a significant evaluating other. 



Teacher Expectancy 

The research reported here is also closely related to research in 
the teacher-expectarcy field but differs in two crucial respects: it is 
directly concerned with the child's expectations and it attempts to 
specify what actions taken by adults affect children's es^ctatlons. 
In other work in this field, much of vdiich is equivocal, teachers' 
expectations are manipulated euid then the subsequent behavior of children, 
usually performance on standardized tests, is observed. As we have 
pointed out in detail elsewhere (Entvrisle & Webster, 1973b) there are 
many ways to explain previous findings: teachers may not respond to 
the expectation-raising maneuver becaucie they do not believe it or do 
not remember it; teachers may not change their behavior in ways the 
student perceives as manifesting positive e^cpec tat ions; students even 
if made the target of the teacher's positive expectations may not respond. 

A number of Investigators have begur to look at the find-grained 
behavior of teachers in relation to studeits to try to pinpoint what 
actually occurs in student- teacher interaction in the classrooa (see 
e.g., Brophy & Good, 1970). These studies, with one exception, do not 
examine race or social class. However, Rubovits and Maehr (1973) studied 
66 white female teachers- in- training in interaction with Junior high school 
students with the students in U-person groups (2 black children and 2 white 
children) . Random members of each racial pair were labelled as "gifted" 
and "non-gifted." It turned out that teachers called on white students 
more and praised them more than black students. There was, in addition, 
a startling "Race X Label" interaction. Although all blacks were given 
less attention and less praise tlian whites the gifted blacks were given 
the least attention and praise, even less than the non-gifted blacks. 

The Rubovits and Maehr study examined "teachers" (actually persons 
like our experimenters, female college students) in free interaction with 
racially-mixed students' groups where the adults could allocate action 
opportunities and then respond to children's actions. The Rubovits and 
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Maehr study cannot be directly compared with our eacperiments for several 
reasons, one being that children in a "small midtfestern city" are 
probably not coiqparable to our S's in terms of social class. For an- 
other, our experiments have a tightly prescribed protocol for adults' 
actions, especially in that the number and type of action opportunities 
allocated cannot vary across race of the child. On the other hand, 
their observations do suggest why white adults may not be credible 
sources for bleu:k children in some cases, for bleu:k children may have 
had a previous history of relatively unfair treatment by white adults 
like that seen in the Rubovits study. 

We are presently conducting further work on children's expectations 
to see (1) whether expectations generalize from one task to another and 
(2^ what kinds of suctions taken by children may affect expectations of 
other children. 
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^ae closest of possible matches was taken when more than one control 
child met the 2.unit separation criterion. If Uo control-group children 
are tied in terms of closeness to the experimental child, the control 
child whose race matches the experimental child's is selected. 

3 

Black inner-city fourth-graders and white ejsperlmenters shoved aa even 
smaller gain, an average increment of only O.U for 16 experiments. 
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Mean Gain Between niase I and Ihase ZII> Experimental 
Children over Control Children 
(number of experiments In parentheses) 



Children's Residential Loci 



White 

Experimenters 



Suburban White 
1.58* 



Ihner-City Black 

0.97 
(80) 



Rural White 

2.03 
(20) 



Black 

Experimenters 



1.'66 
(28) 



(19) 



1.9^ 
(17) 



* For boys only. The treatment was Ineffective for girls. See Entwisle 
and Webster, 1973b, for details. 
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* p < .05 
** p < .01 



Table 2 



Gains In Volunteering for Mixed Racial Groups 
(nuo^er of experiments in parentheses) 



White Experimenter 

. White Treated Black Treated 
Child Child 



Black Experimenter 

White Treated Black Treated 
ChUd Child 



Mean Gain 



k.l6 
(3) 



2.00 



1.00 
(8) 



1.75 



t- values 



2.93* 



2.53** 



0.97 



1.70* 



o 

ERIC 



1*5 



D - 22 



Table 3 



Volunteering Ranks for Mixed Racial Groups 
(rank 1 high; rank k low) 



White Experimenter 



White Treated 
Child 



Black Treated 
Child 



Black Experimenter 



White Treated 
Child 



Black Tl>eated 
Child 



Average Initial 
Rank in Group 

Exper. Child 

Control Child 



3.00 
2.63 



2.69 
2.i»0 



3.25 
3.00 



2.62 
2.62 



1 
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RAISING EXFECT/VTIQNS INDIRECTLY 



Doris R. Entwlsle 
Murray Webster » Jr. 
The Johns Hopkins University 



Abstract 



Previous e^cperinents have demonstrated that It Is possible to raise 
children's esQpectatlons for tasks like those children perform in classrooms." 
Experiments reported here shew that children will "generalize" expectatiofis 
from one task to a second dissimilar task. If children's performance at * 
a task involving planning a meal are positively evaluated, they subsequent- 
ly demonstrate that they have raised their expectations for their own 
performance at a story-telling task. Results are consistent with 
previously reported data in that the generalization effect, while signi- 
ficant, is smaller in magnitude than the direct effect where stoxy- telling 
is used both for positive evaluations and measuring expectations. 
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RAISnO EXPECTATIONS INDIRECTLY' 



The classroom presents a continuous series of unfamiliar tasks to 
each child. At particular times, for Instance, children must learn 
grapheme-phoneme correspondences, cursive handwriting, the multiplication 
tables, how to read a map, or to ask to get a drink of water. Consider- 
able evidence now exists that how well a child expects to do at each new 
task affects his performance, his self-confidence, his Interaction 
patterns with teachers, his Interaction with other students, and often 
his level of learning (see, for example, Brophy & Good, 1970; Entwlsle 
& Webster, 197^b; Rosenthal & Jacobson, 1968). 

For some time we have been engaged In a series of experiments 
designed to analyze determinants and Interaction consequences of children's 
expectations for their own performances. C^e aim of this reseeu-ch Is to 
produce simple techniques to allow Intervention In the chain of social 
psychological events Involved. We believe that children develop 
relatively enduring conceptions of their own likely success at specific 
tasks as the direct result of positive or negative evaluations received 
from the teacher and from other students (Entwlsle & Webster, 1972a, 1972b, 
1972c). Further, one can Intervene in a classroom- like setting to 
produce high self-estpectations by giving selected children positive 
evaluations of their past performances at problems, and these Induced 
high self-esgpectations will be reflected in increased frequency of hand- 
raising in response to group- directed questions (Entwlsle & Webster, 
197'^b). Finally, we have developed a simple procedure for expectation 
raising which has been successful with children of vaxylng demographic 
characteristics (age, sex, and race, in urban, suburban, and rural 
residential locations (Entwlsle & Webster, 1972d, 1973, 197^^). 

The basic esgperimental task consists of three phases in which groups 
of four children work to make up stories. The eseperlmenter reads an 
Inconqplete sentence, and where words are missing, pauses to let children 
think up a "good" (appropriate and interesting) word. Each child who 
thinks he has a good word raises his hand, and one of them is called on 
to supply the "team's word" for that sentence. Raising the hand in 
response to group- directed questions is taken as the operational measure 
of each child's level of es^ectatlons for success at this task. 

Phase t3 I and III are measurement phases. In which children's expec- 
tations are recorded. During phase II, an attempt is made to raise one 
child's expectations by having him complete a story entirely by himself, 
filling in all the needed words when the experimenter gives the sentence 
cues.' (Other children, who form a control group, meet elsewhere with 
another experimenter.) The cniclal aspect of the experimental treatment 
is that each response the child gives is positively evaluated. In the 
measurement phases, no evaluations are given. By comparing the phase I 
to phase III gain scores on hand- raising frequency for experimental 
group children (those receiving the phase il treatment) to scores for 
matched control group children (who do not receive any evaluations of 
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performances), one can determine if the experimental treatment led to 
increased rates of hand-raising. 

E]<periinents have so far involved about 1000 children, and have been 
widely effective across sociological subgroups. Tiiey show it is possible 
to raise expectations through controlling performance evaluations, and 
these inqproved expectations are reflected in a behavior widely regarded 
as an important determinant of actual learning, willingness to engage in 
problem solving attempts in the classroom. But whether one can raise 
expectations for a new task has not so far been tested. The research 
to be reported here involves such a test. 

Recent theoretical work (Berger & Fisek, 197^; Kervin, 1973) asserts 
tnat In the absence of information regarding competence at a specific 
task, individuals will form expectations from knowledge about some other 
task. In other words, if an individual needs to decide how likely he is 
to perform well at some new task and if there is no available information 
regarding his performance at task A, he will use knowledge of his ability 
at task B and predict that his ability at task A is the same as that at 
task B. Laboratory experiments with college students support this claim. 
In the classroom, then, if a child has no idea how well be can do some 
new task such as arithmetic, but knows he is good at spelling, he will 
decide as a "best bet" that he can also do well at arithmetic. ^ 

Whether the process works as simply as Just described, particularly 
whether esgpectations will generalize across very different tasks and 
whether grade school children in a classroom will behave like college 
students In a laboratory, remain undemonstrated. We therefore undertook 
a set of ejcperiments to shed light on the expectation generalization 
process with grade- school children. 

The experiments reported here were designed to resemble as closely 
as possible previous experiments in order to permit direct comparison of 
results across experiments. As mentioned, the experiments consist of 
three phases. In phase I a group of four children are assembled and the 
story-telling task is described to them. Then they build a story by 
volunteering 12 wordF, exactly as In previous experiments. In phase II, 
one child is selected to receive the expectation-raising treatment at 
the unrelated task, and the other three children go to another room and 
have a story read to them. Instead of making up his own story as in 
previous research, the experimental group child works on an entirely 
different task, one called "meal-planning." The meal-planning task is 
presented to the child as "choosing foods which are nutritious and go 
together well for a holiday dinner. "3 A large bulletin board containing 
colored pictures of various kinds of food is placed before the child and 
he selects food pictures one at a time and places them together on 
another bulletin board nearby. The evaluation procedure is comparable 
to that of previous experiments: the experimenter, after every food 
choice, tells the child he has made a god and nutritious choice, and 
the resulting dinner after all choices have been assembled gets a very 
high overall evaluation. Then the original groups are reassembled for 
phase III, escperimenters are rotated between groups to prevent their 
knowing which child received the expectation-raising treatment, and the 
i3tory-telling task is repeated. Nothing is said about possible relevance 
of the two tasks, meal-planning and story- telling. 



Subjects for these experiments were third and fourth graders from 
two schools enrolling only white children In rural Maryland, and one 
school In Baltimore City enrolling only black children. Children of 
both sexes participated In the experiments, though each four-person 
experimental group was composed of only one sex. Experimenters were 
white women for the rural children and black women for the black children. 

If expectations generalize -in this context, one would predict two 
results. First, by comparison with the untreated control group children, 
the experimental group children should show greater phase I to phase III 
gains in es^ectations, measured by rate of volunteering. Second, because 
tasks differ between phases expectation-raising effects should be weaker 
than effects previously observed when story-telling was the task in all 
three phases. 

Table 1 presents the average gain scores for experimental and control 
groups in both settings. Escperimental group children (white) in rural 
MarylEUid show a mean gain of l.Uk between phase I and phase III; control 
group children show a comparable gain of .33. The difference or "expec- 
tation advantage" for escperimental group children is +1.11, and is sig- 
nificant at about the .03 level using a one-sided t-test for matched 
pairs. ^ Similarly experimental group children (black) in Baltimore show 
a mean gain of 1.10 between phase I and phase III; control group children 
show a negative gain of -Q.ky. The difference 1.55 is significant at 
about the .02 level. 

Table 2 presents coiQ>arable data for previous expectation experiments 
when evaluations of story- telling were used to raise expectations. As may 
be seen, although both procedures produce statistically significant results, 
the effect of indirect expectation-raising is not as large as the effect 
of direct manipulation in previous experiments. In previous experiments 
(Entwisle & Webster, 1973) the mean gain for third grade white rural 
children was 2.60; for thiitl grade black inner «lty children it was I.85. 

The gain scores in Table 1 show that it is possible to raise 
children's expectations for the story-telling task by giving them positive 
evaluations for an unrelated task, planning a meal. The expectation 
generalization process apparently occurs for young childz>en in settings 
resembling those of a classroom. The results are consistent with results 
in more highly controlled laboratozy settings with college-age subjects. 

In practical terms these experiments show that an indirect expecta- 
tion-raising procedure may operate in ordinary classrooms where it is not 
possible to control evaluations directly. A teacher need not wait, perhaps 
in vain, for a praiseworthy perfonnance in arithmetic: giving a child 
heavy positive evaluation in some other area probably affects his general 
expectations, and thus his willingness to try arithmetic problems. 

The two tasks used for these experiments were dissimilar, and this 
suggests that two tasks need not be perceived as relevant in order for 
generalization to occur. In laboratory experiments on expectation general- 
ization the two tasks were also dissimilar: matching English and non- 
English word meanings in one case, and estimating the area of colored 
figures in the second. By contrast, learning experiments on stimulus 
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generalization use closely related stiimili, such as a ^ps and 700 cps 
tone. The expectation generalization Process assumes a cognitive linking 
of two stimuli (ability beliefs) which are not at all intrinsically 



similar. 



in further e^cperiments we hope to test further the applicability of 
the generalization phenomenon across sociological subgroups of the 
population, and in conditions of e^cplicit relevance of the two tasks. 
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This research was supported by Office of Education grant OEG- 3- 71-0122. 
We thank Frank Tondrlck and Charlotte Harper, principals of the schools 
Involved In the experiments, and students and staff In these schools for 
their help with the research. 



The theory says that generalization will occur In all cases except where 
the Individual knows for sure that the tasks require unrelated skills. 
In other words, the burden of proof Is upon a claim that abilities are 
unrelated, and It is not necessary for the Individual to know for sure 
that the tasks are related. 

^e thank Lynne Roberts of the University of Washington, Seattle, for her 
assistance in developing the meal-planning task. 



A word about computational procedures is in order. In these experiments 
there are possible sources of spurious successful results; for instance 
through increasing excitement of experimental group children, or through 
"regression to the mean" from choosing either a low phase I interactor 
for the es^erimental group, or high phase I interactors for the control 
group. To avoid these and other problems, we adopt strict criteria for 
inclusion of data. The e^erimental child in each group is cooqpared to 
the one other child closest to him in terms of phase I rate of volunteering; 
further, if no other child is within 2 units of the es^erlmental child, 
data from that group are not included for analysis. This procedure controls 
for "excitment" effects (by coiqparing only phase I to phase III gain 
scores), controls for "regression" effects (b/ requiring phase I compara- 
bility between experimental and control group children), and substantially 
reduces the N upon which statistical tests are based (by using only one 
of the three control group children from every group). Use of these 
criteria resulted in exclusion of 2 of 20 groups of children. 
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Table 1 

Mean Gains in Volunteering for Experimental and Control Children 

Generalization Experiments 



White, rural Ss 

^ 

Blacky urban Ss 



*♦ 



Experimental 
+1.1U 
+1.10 



S.E. of 
Control Difference 



+0.33 
-0.1*5 



.611* 



One- Sided 
t- value 

2.0U (p < .03) 
2.52 (p < .02) 



♦ 18 pairs 
20 pairs 



Table 2 



Mean Gains in Volunteering for Experimental and Control Children; 

Direct Manipulation Experiment 

Experimental Control 

White, rural Ss +2.60 +0.57 

Black, urban Ss +I.85 +0.97 

* N = 20 
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EXPECTATION EFFECTS CN PERFORMANCE EVALUATIONS 
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Abstract 



Situations In which esipectations affect performance evaluations 
of actors are described and analyzed in terms of esQtectations states 
theory. Results of three experiments (N = 289) are presented testing 
derivations of that theory, and some implications of the results for 
theory building and for practical applications are discussed. 



EXPECTATION EFFECTS OK PERFORMANCE EVALUATIONS 



Evaluations of performances, and by extension, of the actors making 
those performances, are iiqportant in a wide variety of social situations. 
Of particular interest to social psychologists are cases in which eval- 
uations are influenced by factors other than objective criteria. A 
variety of processes— "halo effect" (Symonds, 1925), "cognitive set" 
(Kelman, 1961), group influence (Asch, 1956; Schachter, 1951) > status 
effects (Caudill, 1958} Sherif et al., 1955), and others— have been pro- 
posed to account for discrepancies between objectively recorded perform- 
ances and subjective evaluations of performance. 

Distortion of evaluations in the direction of previous beliefs 
about abilities of the performing actors has been widely documented 
(see, for exan^les. Bales, 1970; Hist, 197O} Kelman, 1961). If one 
actor is thought to be good at a task, his performances are more likely 
to be highly evaluated than equal performances by an eustor thought to 
have low task cosipetence. One way to account for this type of cognitive 
distortion has been proposed by Berger, et al, (1972). They argue that 
problem solving interaction leads to formation of more or less enduring 
rbility conceptions, called expectation states , for the actors involved. 

ce expectation states are formed and attached to actors, they are pre- 
dicted to affect most important features of behavior: the higher the 
expectations held for a given actor, the more likely he is to be given 
and to accept chances to perform, to receive agreement and positive 
evaluations for any of his performances, and to exert influence over 
other group members. 

Although this theory can account for many of the evaluation distor- 
tions previously mentioned, and In fact is claimed to apply to an even 
wider range of phenomena, direct tests and applications of the theory are 
still few. At present all direct tests have been conducted in laboratory 
settings, and every one of these tests has taken as its dependent variable 
rejection of influence in case of disagreement, only one of the many 
behaviors predicted to vary with expectation states. Obis limitation 
makeB it hard to assess either the scope of the theory or its usefulness. 

Knowledge of another's expectation state, or of a way to change it, 
would have enormous value in day-to-day living. Accordingly, the eacperi- 
ments reported here use a naturalistic setting and test hov previous 
expectations affect future evaluations of performance. They also extend 
our previous work on the determinants and consequences of children's 
expectations for their own performance and how to change them (see 
Entwisle & Webster, 1972, 197U, for summary). 

Expectation states theory forms a congenial context for analyeing 
interaction, since the schools may be seen as an arena where children 
perform and are evaluated. Teachers ask questions in class, give tests, 
and make homework assignments; students respond and their performance 
attempts are evaluated by the teacher and by other students. Moreover, 
evaluations of schoolchildren and their performances cannot be wholly 
based on objective criteria. Vor one thing, objective evaluative standards 



are often lacking or ambiguous. What is a "good performance" In an 
esthetic Judgment task? What, for instance, differentiates a good poem 
or a good painting from a bad one? Second, correct answers may be hard 
to recognize for difficult or conqalex tasks. Evaluating a single line 
of a copaputer program illustrates this point. In elementary school, 
peer evaluations, though very inportluit to students, are voiced by peers 
who may not be capable of recognizing good or bad performances. Third, 
teacherfi often must evaluate students whom they have not had sufficient 
opportunity, or perhaps any opportunity, to test adequately. A mark in 
arithmetic may be based on inadequately sampled classroom behavior. 
Peer evaluations likewise are frequently based upon inadequate or biased 
test imformation. Fourth, often teachers are required to evaluate kinds 
of performance that neither they nor students can define. First grade 
teachers, for exainple, often give marks in "Language" or in "Spelling" 
when neither they nor their first graders can say exactly what "Language" 
or "Spelling" involves in first grade. Finally, eajpectations for some 
students are so firmly fixed on prior "evidence" that performance not in 
accord with e^gpectations is disbelieved (a phenomenon also documented by 
Whyte, 19U3, in the famous bowling incident of Street Comer Society , 
and lately for first-grade children by Seaver, 1973) . Each of the above 
circumstances tends to confuse and attenuate evaluation solely by objective 
standards. To the extent that objective criteria are not used, expectations 
held for individuals making the performances will influence evaluations. 

If objective evaluative criteria fail to specify evaluations exactly, 
then three possible situations, differing in degree of social definition 
provided by expectation Information, are possible (see Berger et al. , 1972): 
(1) In a maximally defined situation, performance expectations for the 
specific task at hand have been previously assigned to actors; for exas^Ie, 
an actor may be known to possess high ability at task A, which are Just 
the skills needed for successful task completion. Then it is a single 
step to decide that any specific performance of that actor is probably a 
good one and should be highly evaluated. One expects a Metropolitan Qpera 
singer to do well on a given aria even though one has never heard him sing 
it before. (2) In a less completely defined situation, ability is required 
at task A, but no expectations for performance of the actor at task A are 
known. However expectations are held for his performance at task B, and 
task B is known to be relevant to task A. The theory predicts that expec- 
tations will then be formed for task A which are the same as those already 
held for task B. If a person excells in snow skiing, we ejcpect he will be 
good at water skiing. (3) In a minimally defined situation, ability at 
task A is required, ability at task B is known, but nothing is known about 
whether tasks A and B are related. In this case, so long as tasks are not 
explicitly dissociated (as they would be, for instance, by telling an actor 
that the skills are completely unrelated), the prediction is that individ- 
uals will conclude that thf tasks aro probably related. A research 
assistant who is good at computer progrEumning will be expected to be good 
at planning laboratory experiments if nothing else is known about him. 
This process yields the same outcome as types (1) and (2) situations: the 
tasks are assumed to be relevant, expectations are assigned to actors, and 
these expectations affect the likelihood that actors will receive positive 
evaluations of perforra/inces. 

One interesting consequenct- is that all three situations are predicted 
to yield identical outcomes for both expectation states and subsequent 
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behaviors of Individuals. That is, so long as certain structural 
conditions are met, it does not matter whether the process is entered at 
stage (3) where only ability at task B is known, at stage (2) where in 
addition task B is known to be relevant to task A, or at stage (l) where 
ability at task A is also known. This consequence may be accurate but 
the only available data for direct test (Berger et al., 1972) do not 
provide particularly strong confirmation. Tkiree e3a)eri&ental studies 
of the same assertions are provided here for social situations conparable 
to those of Berger et al. Ihe concrete settings, the e^erlmental design, 
and the measure of esgpectation states differ from those used previously, 
however. 

E3Q)erimental conditions were designed to reflect the varying degrees 
of situational definition described above. Condition 1, which we call 
assignment , was maximally defined: subjects were told the ability of the 
other at certain tasks, this ability was made relevant to the particular 
task, and finally, expectations for the particular task were assigned to 
the other person. Condition 2, relevance , was less completely defined: 
subjects were told other's ability at emother task and the ability was 
said to be relevant to the particular task, but no assignment of es^ecta- 
tlons to other was made. Condition 3> activation , was minimally defined; 
subjects were told only other's ability at another task, but no mention 
was made of possible relevance between tasks. 

Experiments were conducted in the spring and replicated In the fall 
with different children in third grade classrooms of two suburban schools.^ 
The investigator addressed the entire class, euid told the children he was 
interested in finding out how well they could tell good words from bad 
words in sentences—the operational measure of "unit evaluation of a 
performance," which is predicted to vary directly with esqpectations held 
for other. (Good words were described as those which fit in well with the 
rest of the sentence, and which are exciting.) Then he handed each child 
a sheet with 10 sentences (Chart I). Each sentence contained some words 
("performances") supposedly supplied by a (fictitious) other student. The 
fictitious students were described as having either high ability at school- 
work (Task B), or low ability. Instructions for the three conditions 
varied as follows. 



[all conditions] We have some words given by boys and girls 
with very high ability at most school subjects, and some by boys 
and girls with low ability. Students with high ability do 
better at reading, spelling, and arithmetic than students with 
low ability. 

[relevance and assignment only] Students with high ability at 
schoolwork also usually give better words than students with 
low ability. 

[assignment only] Today we have some words from students who 
have given good words in the past, and some from students who 
have given poor words in the past. 

[all conditions I However wc have not yet graded the words you 
are about to tsec. 



As mentioned, the set of experiments was run twice. !^o classrooms 
of subjects were used for each condition on both occasions, making six 
classrooms per ewriment. or twelve classrooms all told (the average of 
persons per classroom = 24.1). iJie sentences shotrn in Chart I were used 
in every classroom. In the first classroom of each condition, every odd 
numbered sentence was described as coming from a poor student. In the 
second classroom, descriptions of the fictitious authors were reversed: 
the even numberc^'^ sentences were supposedly given by good students. 

Subjects graded each sentence by checking a box from the following: 

very good good fair poor very poor . 

Data were tallied by arbitrarily assigning "very good" a score of 1, and 
"very poor" a score of 3* Tables 1 and 2 present mean scores given to 
sentences, according to expectation condition. 

It is evident from results in Tables 1 and 2 that in all three 
conditions expectations held for the author had marked effects on 
evaluations of his performances. In the majority of cases, when a given 
sentence was attributed to an author for whom high expectations were 
held, it was evaluated more highly than when the same sentence was 
supposedly produced by an author for whom low expectations were held. 
In Eigperlment I, this predicted effect of expectations on evaluations was 
observed 9 out of 10 times in the maximally defined (assignment) condition; 
6 out of 10 times in the intermediate (relevance) condition; and 10 out of 
10 times in the minimally defined (activation) condition. In Bxperiment II, 
the conparable proportions are 10/10 in assignment, 9/10 in relevance, and 
10/10 in activation. 

To assess the assertion that all three treatment conditions will 
produce equal effects on expectations, as well as to aid in further 
assessing the reliability of differences between expectation conditions, 
we present the results of an analysis of variance in T&ble 3. Fixed-effect 
factors are Treatment (high or low expectations for author). Condition 
(assignment, relevance, activation), and T x C interaction. Treatment and 
Condition both produce significant effects (p < .05), but the T x C 
interaction does not approach significance. 



Discussion 



These results show that, in three different conditions, expectations 
held for an Individual affected evaluations of his performances. Although 
this effect is not surprising because It has frequently been pointed to 
in other settings, our work here apparently constitutes the first direct 
test of the assertion in an experiment excluding most other possible 
sources of variance. Results also indicate the considerable Impact of 
peer expectations on evuluations given to other students. Peer evaluations 
are Important factors In the classroom, so it is useful to confirm that 
the general relation between expectations and evaluations holds in this 
specific case. Both the laughter which greets certain students' answers. 
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and the awe which is accorded to others, seem Interpretable in terms of 
expectation states theory. 

Expectation states theory asserts that individuals will cognitively 
make the connections necessary to structure the incomplete "relevance" 
and "activation" conditions, and that as a result, these conditions will 
produce escpectations identical to those given in the "assignment" 
condition. In a naturalistic setting such as the one we used, many 
factors which are uncontrolled can produce minor differences between 
conditions like ours, which all show statistically significant expectation 
effects. The most plausible alternative to the equality predicted by the 
theory is that conditions would be ordered by magnitude of expectation 
effect as follows: activation < relevance < assignment. There is no 
evidence in either set of e:q?eriments for this ordering from the analysis 
of variance shotm in Table 3. Both Conditions (assignment, relevance, 
activation) and Treatments (high or low esgpectations) produce significant 
differences, but the Condition difference is the effect of classroom 
differences, not differences in information given to subjects. In order 
to establish any ordering of esgpectation effect, we would require a 
significant T x C interaction, and that clearly did not occur. In general, 
data in Tables 1 and 2 indicate that students in the two classro(»ns 
selected for the "relevance" treatment judged the words to be a little 
better than did the students in the other classrooms. The mean scores 
for all sentences in both expectation treatments were: assignment, 2. 77; 
relevance, 2.U3; activation, 2.69. This transformation of mean scores 
is what produced the significant main effect, not the experimental 
mEinlpulatlon. 

Either for theory development or practical analytic tasks, developing 
this new measure of expectations (evaluations of others' performances) 
should increase the flexibility accorded researchers. The measure used 
here is easy to liiiplement and has straightforward relations with many 
cases of practical interest. Often it is not practical or possible to 
make use of disagreement resolution as a measure; as Just one example, in 
many naturalistic situations there are nox-ms which discourage the appearance 
or the recognition of disagreements themselves. In addition, the evalua- 
tions measured may be used in cases where the individual is not in direct 
interaction (such as disagreement) with the others for whom differential 
expectations are held. 

Finally, we wish to note that, although our resiats show a tendency 
for individuals to base evaluations partially on data other than objective 
criteria, the tendency Is not always harmful. To say that evaluations 
"should be" Independent of expectation states seems as useless as saying 
that people "should be" unconcerned with status In their interactions. 
In many situations (such as those described earlier) individuals must 
reach evaluative conclusions when they simply do not have access to 
coioplete information from objective sources. In these situations, 
expectations held for actors — which result in most cases from observing 
previous performances of these actors—may well constitute the most 
adequate available basis for structuring an underdeflned situation. What 
is rightly objected to Is over- reliance on e:qpectations in situations 
where objective criteria for evaluation are readily available. 



Footnotes 



^Experiments reported here were supported by Office of Education Grant 
No. 3-71-0122. We also acknowledge the research assistance of Muriel 
Berkeley, Beth Zaro, Michael Swafford, and Linda Olson with this work. 

^e thank Dr. Evart Cornell and Mr. George Fanshaw, Principals, and the 
staff and students of Loch Raven and Stoneleigh Elementary Schools. 
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Chart I 



Sentences Used for the Evaluation Task 

1. There once was a very tall prince. 

2. There once was a handsome movie star. 

3. In order to fool the ladles, the man drescied up as a piece of 
furniture. 

U. In order to fool the robbers, the princess dressed up as a big black 
bear. 

3. The oceem was l\ill of sharks and whales. 

6. The parking lot was full of people with dogs. 

7. When the Indians found the cowboy, they sat down and said hi.' 

8. When the teacher found the book, she opened It and began reading. 

9. This was a good thing to do on Sunday. 
10. This was more than he wanted. 
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Ttible 3 

ANOVA of Overall Mean Scores 



Source of 

Variation df Mean Square F p 

Treatment 2 1.16 3.30 < .05 

Condition 1 12.0? 3^^.39 < .05 



Replication 1 0.08 

T X C 2 0.20 

Within T X C X R 113 0.351 



n.s. 
n.s. 
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EFFECT OF A PRINCIPALS EXPEaATIONS ON TEST 
PERFORMANCE OF ELEMENTARY SCHOOL CHILDREN* 

DORIS R. ENTWISLE,* BVART CORNELL,* JOYCE EPSTEIN* • 
Johns Hopkins UnivmiSy 

Summ0fy.—h 17<lassroom experiment shows th«c t (mtle) priocipal's 
commciiM convi7ing positive expectations to hoys on t first test tre followed by 
improved performance on a second test. No effect of comments was apparent for 
girls. The experiment is diKussed in the context of current research on teacher 
expecuncy. 



Expectations of teachers for students and how these expectations affect per- 
formance have been the topic of much recent research (Brophy & Good, 1970; 
Claiborn, 1969; Fleming & Anconnen, 1970; Meichenbaum, Dowers, & Ross, 
1969; Rosenthal & Jacobson, 1968 ) and also of some controversy. In most of the 
studies where teachers' expectations are presumed to influence pupils, no actual 
measurements of teachers' behavior or attitudes are made. One exception is 
work by Brophy and Good (1970) who in studying dyadic contacts between 
teachers and students observe that teachers demand better performance from 
those students for whom they hold high exj^ctations. Another exception is 
the study of Rothbart, Dalfcn, and Barrett ( 1971 ) who in a simulated classroom 
situation show that teachers are more attentive to students for whom they hold 
high cxjiccrhtions. More work is needed on liow school |K-rsonncl convey their 
expectations and how this shajKs students' behavior. The present re]X)rt of a 
school- wide exiHrriinent is an attempt to provide such information on how 
expectancies are mciiiated. 

A study of teacher comments and student |wformancc by Page (1958) 
suggested that on one objective test teachers' comments had a positive effect on 
a student's ranking on a subsequent objective test. In other words, those stu- 
dents receiving comments on a first test ranked higher than other students re- 
ceiving no comment when a second test was given. Written conuncats can be 
considered as one mediator of expectations* 

Because of the current interest in expectancy and performance, we decided 
to pattern an experiment after Page's but to use comments conveying only expec- 
tations (see below) and to explore the range of "significant others." Our 
major interest lay in whether comments by the principal on arithmetic test papers 

'This nsi'arch was supportid by Office of Education, OEG Grant No. OEG-3-71-0122. 
The opinions cxprisscil in this puhliiation do not necessarily represent the position or 
policy of the Ofiicc of Education, and no official endorsement by the Office of Education 
should be infirriii. 

'Department of S(>cial Kclarions, Johns Hopkins University. 

'Principal, Stoncliigh Eknuntary School. Baltimore Ciounty, Maryland. 

'We wish to thank the rraclicrs in Stcincliigh School for tluir help wich this research. 

We also wish to thank Ur. James Tcnnusy for many helpful suggestions. 
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of elementary school students stating tliat he had positive expectations would lead 
to improved performance on the next test. The question at issue is whether 
expectations of a school principal conveyed in a definite but rather distant 
manner will lead to improvements in performances. Is the school principal, as 
well as the teacher^ a "significant other" in students* eyes? 

EXPBRIMCNT 

Teachers in a white middle-class elementary school in the suburbs of 
Baltimore gave a routine test in arithmetic. A mark was assigned the paper 
by the teacher (A, B, Q H). The papers were then collected and given to 
the principal, who had announced to each class before the first test (17 
classes. Grades 2 through 6) that he was going "to look over the papers" and that 
he would be "writing conunents on some of the papers." For each letter-grade 
stratum a random division determined those papers to receive comments and 
those to receive no comment. Standardized comments (sec the list below) 
calibrated to the mark the paper was given were placed on a random half of the 
papers in each class. 



Grade Given Paper 
by Teacher 


Conunent Added by Principal 


A 


I IcDOw you will do as well next time. 


B 


Maybe next time you can do even better. 


C 


I feel you can do even better than this. 


D 


I know you can do better chin this on the next test. 


B 


I am sure you can gyt a better grade on the next test. 



The comments were phrased to emphasise cxpcciaiions. Comments directly con- 
veying reinforcement (pruise) were deliberately avoided. Within a few days 
the papers were returned to the students and shortly thereafter a second arithme- 
tic test was administered by the classroom teachers. Scores on this second test 
are the criterion. The two classroom tests were tests that teachers would be 
giving routinely. That is, in all grades in this school every week or two, arith- 
metic tests are routinely given to check pupils* progress and to provide feed- 
back for the teacher and students. The experiment was superimposed on this 
ordinary test-giving pattern. Some teachers gave tests 1 wk. apart, others 2 wk. 
apart. The tests covered material that was being taught during the time of the 
experiment and this of course differed from grade to grade and also from 
teacher to teacher. This kind of testing, since it is part of standard teaching 
practice, provides a suitable vehicle for checking effects of comments. If we had 
tried instead to aher normal teaching practices any effect of comments would 
be confounded with other alterations introduced. 

Prior to the experiment, teachers were asked to assign a "suggestibility" 
rating to students in their class on a 3-point scale as follows: (1) "highly re- 
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spofisive;" (2) "moderateiy responsive;" (3) "no obvious rapoose;" (4) 
"highly reactive" (negative); and (5) "highly reactive" (positive). This vari- 
able was included on the supposition that some children inighc be unresponsive 
to comments and therefore that the experimental treatment would be ioeffective 
for them. 

Test scores were transformed from letter grades to numerical values (A =: 
+2, B = -HI. C s: 0, D s: -1, E = -2). The zenH>rdet correlation matrix 
presented in Table 1 shows that variable 7 (Treatment) does not correlate 
significantly with variable 9 (2nd test score) when boys' and girls' scores ere 
grouped together. However, the correlation between treatment condition and 
posttest score for boys is .16 (« — 217), significant at the .02 level, and for 
girls is —.05 (» = 223), not significant. (The correlations between pretest 
scores and treatment are only .02 and —.05 for boy:' and girls respectively.) 
For the 17 classes, if test gains are compared by sex for the comment and no- 
comment conditions there is an average gain of 2 units by class for boys and a 
decline of almost 1 unit by class for girls. A t test of this sex difference shows 
that it is significant beyond the .05 level (/lo ^ 2.21, classes as the unit of 
analysis). The difference between test scores of individual boys in the comment 
and no-comment groups is about half a letter grade. This difference of half a 
letter grade for boys brought about by the experimental treatment is not hvge 
in absolute terms. On the other hand, it followed a "small" ttcatment, the 
writing of a single comment on one classroom test. 

The experimental treatment did not interaa with the students* rated re- 
sponsiveness. That is, those students rated by teachers as "highly responsive" 
or "moderately responsive" did not appear to be any more affected by a princi- 
pal's comment than students rated as "not responsive." As one would expect. 



TABLE 1 

ZBXO-OROBR CORKBLATIONS BBTWBBN VAUABLBS, GRADES 2 TO 6 (N s 442), 

Afiq = IIJ 



Variable 


1 2 i 


4 


5 


6 


7 8 


9 


1. 1Q» 


J8 .19 


.98 






.18 


M 


2. Ik semester math grade* 


.66 


.52 


.27 




.47 


AJ 


3. Effort Grade* 




27 


.4) 




.32 


.37 


4. Math Achievement* 










.36 


.37 


). Suggestibility** 








.14 


.24 


iO 


6. Sex 














7. Treatment (comracnr vs 














no comment) t 














8. 1st Test (transformed )t 












9- 2nd Tcsr (iransformcdt 















No/^.<— Only correlations significanr at or beyond the .01 level arc listed. 

^From Khool records. **Teachcrs' judgmenis. tExpcrimenul treatment. tGitdes «»* 

signed by urachcrs. 
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responsiveness as judged by ccdcliers correlaces wich sex (girls being more re- 
sponsive) and wich all ceachcr-assigned grades (firsc semescer mathemifics 
grade, effort grade, and grades on classroom cescs) buc noc wich scores on scan* 
dardiaed cescs (IQ and macbeniatics jschievemenc). 

Discussion 

The chief finding is chac boys' scores on a second arichmetic cest improve 
when che boys received a commenc from che principal conveying posicive ex- 
peaacions on a firsc cest, relacive co a concrol group. Girls' perfdrmance on 
second cesc was not affected by his commencs on che firsc cest 

The significanc correlation between experimencal crcatment and postcese 
score for boys, associated with an average increase of half a letter grade, must be 
weighed against several faaors, first the **smallness" of the treatment already 
mentioned. Second, arithmetic is a subject where trying harder may have little 
effea. In some areas like language arts, where trying harder might lead to 
longer or more elaborate rc.s|onscs, comments might yield more striking im- 
mediate effects. Third, the correlation between treatment and boys' posttest 
grades is much like the correlation between standardized tests (IQ and math 
achievement) and posttest grades. Unreliability of classroom tests attenuates 
relationships between these tests and other variables including experimental 
treatment. 

The present study is consistent with earlier work. Page reported a signifi* 
cant difference in favor of comments, but his experiment differs in several im* 
portant ways from the present one— in panicular there is no breakdown in terms 
of sex so one cannot tell whether there was a differential effea for boys and 
girls, and comments were delivered by teachers rather than by the principal. 
It is hard to tell how large Page's comment-effect is. Apparently, triplets of 
students in che 3 experimental conditions (personalized comment, standardized 
comment, no comment) were ranked 1, 2, 3 according to performance on the 
first test and then re-ranked on the basis of the scores on the second test. Fbr 
. 74 classes with about 30 per class, che largest difference ia mean rank between 
means of treatment groups is 0.18. This could stem from small differences in 
absolute performance level although no precise statements can be made from 
Page's pubh'shed results. When our data are ranked in an analogous way, it 
appears that the magnitude of the effect in Page's experiment and the effea 
in the present study may not be far diifferent. 

Two plausible mechanisms could explain the differential effect of comments 
by sex. (1) As the principal delivering the comments was male, he might be 
persuasive for t^ys since most other school personnel were female. (2) In 
other research in the same school where expectations of children for their own 
performance were raised in a short experiment, boys were generally found to be 
more responsive to the experimencal expectation raising treatment than girls 
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(Entwisle & Webscer, 1972). This creacmenc involve in individuil lessioo 
with an aduh where all responses in a simple story-generacing task were heavily 
reinforced. Ic was choughc chat boys mighc be relatively deprived of praise from 
teachers— several workers repon that boys receive less praise and more blame 
from teachers than girls (Felsenthal, 1971; Jackson & Uhaderne, 1967; Meyer 
& Thompson, 1936) so that their expectttions initially are at a generally lower 
level and therefore easier to raise. The same line of reasoning can be applied 
here to explain the greater effectiveness of the principalis comments for boya— 
because they have received few positive comments chcy may be especially recep- 
tive. 

This research is related to an expanding body of finding! on teachers' 
expectancy (e.g., Meichenbaum, Bowers, & Ross, 1969) and suggests why some 
results from research on teachers' expeaancy could be equivocal or negative 
[We intentionally avoid raising issues about the methodological flaws others 
see in this research, for example, Thorndike (1969).] First, teadiers' expec* 
tancies represent expectations of one set of significant others, but expectations of 
peers, of parents, and as shown here, of other school personnel, are probably 
operating concomitantly. If expectations of various "others** are oontradictoryi 
they could work against one another to attenuate effects in studies of teachers' 
expeaancies. Second, research has so far paid little attention to the status char* 
acteristics of the person whose expectations are altered. A middle-aged female 
teacher may have low credibility for a 10«yr.*old boy, so that her increased expec* 
rations for him arc relatively impotent in affecting his behavior. Much further 
work needs to be done on exactly how cxpeaations are transmitted. The present 
study is a small step in that direction. 

To sum up: expeaancy comments by a school principal (male) apparently 
are effective in improving boys' performance. The size of the inaease, although 
statistically significant, is small in absolute terms. Whether such an effea could 
be produced rcpcaicdiy for the same students and whether the effea would 
persist should he studied. 
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PROCEDURES THA'^ TURNED OUT TO BE UNSUCCESSFUL 



Murray Webster, Jr. 



.1 ) 



Procedures That Turned Out to be Unsuccessful 



We report here three different procedures which we found to be 
unsatisfactory for the study of classroom es^pec tat ions, or for intervention 
in the natural course of development of expectations. Each procedure 
was attempted in the course of a particular phase of our investigations; 
that is, it was developed for a purpose related to the conceptualization 
presented in the body of Section I. We describe them here sequentially, 
as they entered the course of the research program. 



A. The Artistic Judgment Task 



From the beginning of the experimental program, we have been concerned 
with studying the sources and the consequences of es^ectations in general , 
rather than studying expectations for some particular skill such as 
reading, or story telling ability. As we noted, because of the existence 
of the story telling task developed in previous research, we first sought 
to adapt this task for our expectation raising experiments. However 
ifflfflediately after the first successful attempts to raise expectations 
with this procedure, we began to search for other tasks which might be 
used for the same purpose. Our thinking was that tho early demonstrations 
of our technique would be more persuasive if they could be shorni to apply 
to more tasks than the story telling. However as we noted earlier, 
because of the stringent set of requirements which a task must meet for 
this work, task development is not an easy problem. 

One task which seemed promising we called "Artistic Judgment Ability." 
Artistic Judgment Ability met all the requirements for our cxperlments-- 
such as being interesting to children, consisting of discrete performances, 
permitting either posit ive or negative evaluations of any given performance-- 
€uid it offered the additional virtue of being quite different from the 
story telling task. Q.'hus we could argue if the experiments with it 
worked that the procedure for raising expectations was probably a very 
general one, not tied to a particular sort of task. 

During the fall of 1971 we spent considerable time in pilot studies 
of Artistic Judgment Ability. The basic experimental design for raising 
expectations already described was used with only the task differing. 
Artistic Judgment Ability was presented to children as being the ability 
to discriminate artistic merit in drawings; to tell good drawings from 
bad onca, in other words. Children were shown a series of figures like 
Figure 1 and were asked to "grade" them from A to E. This presentation 
was done in the classroom, using booklets of 10 drawings. Booklets were 
collected, suposedly to be graded by the experimenters, and after a time 
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were returned to the children. They were also at that time told that 
their Artistic Judgment Ability was either "very good/' or "average." 

Later that same day, groups of four children were assembled, containing 
one "e^gperimental" child who had been told she was very good, and three 
"control" children who had received scores of average. Subjects for 
these tests were white middle-class girls, third and fourth graders. In 
each group, 12 new drawings were shown to the group, and children asked 
to volunteer to grade them. 

Structurally, this experiment is the same as our basic e^^erinent: 
First, the attempt is made to induce high expectations in one randomly 
selected (esgperimental) child. Then the experimental child is put into 
a group with three children of presumably average ability, they all are 
given a set of 12 action opportunities, and are given the chance to emit 
performance outputs. The performance outputs, or problem solving atteoopts, 
are then recorded for each child and used as the measure of expectations 
held. The prediction is that the child with high induced esgpectations 
will be more likely than control group children to accept a given action 
opportunity and attenxpt to perform. 

Results were essentially zero: our analyses show no effect of the 
earlier manipulation procedure. We ran a few groups to see whether the 
time lag between manipulation and measurement phases could be responsible- 
children might have forgotten their supposed ability levels— but that did 
not seem to be the case. Also, interviews with children usually elicited 
strong evidence of remembering what ability we had assigned to them. 

Of several possible reasons for the failure of this task, the one 
which seems most reasonable to us is that "artistic Judgment" is socially 
defined to be a skill difficult to measure and without universallstic 
standards. One person's artistic opinion is considered as good as the 
next person's, that is, there are no clear distinctions between "good" 
and "bad" when it comes to art. Some children volunteered ideas similar 
to this one, though in most cases they simply said they saw no reason to 
volunteer more often than anyone else in the group setting. 

VThat this interpretation indicates, of course, is that Artistic 
Judgment fails to satisfy the major scope condition for Expectation 
States Theory, namely, task-orientation. Artistic Judgment seems to belong 
to the realm of process orientation (in Bales' terms), not susceptible to 
clear or objective assessment. The meal planning task which werlater 
developed and used successfully meets the same sorts of criteria which 
prompted us bo develop Artistic Judgment, and it turned out to be much 
more successful. 



B. The Allocating Action Opportunities Es^erlment 



During the summer, 1972, we began to get interested in effects of peer 
expectations and behavior on the expectations children develop for them- 
selv<>s. On an intuitive basis, it seemed clear to us that children do 
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develop ideas of how good each other is at various school tasks, and 
they show these Ideas in a variety of ways. One Ijqportant behavioral 
consequence of expectation states held for others is predicted to be 
differential allocation of action opportunities: all other things being 
equal, the theory predicts that individuals will give chances to perform 
more frequently to those others for whom they hold high expectations 
than to those for whom they hold low expectations. Also, our theoretical 
analyses of classroom interaction indicates that the differential 
allocation by the teacher of chances to perform is one of the main ways 
in which he or she conveys the expectations held for individual students, 
(Such a process operates, we believe, in the famous "Pygmalion effect,") 
The allocation of action opportunities experiment was designed to 
deomonstrate this effect. 

The experiment devised was a modification of our basic expectation- 
raising experiment. Groups of three children were assembled for the 
story telling task, and one additional child— whose later behavior was 
to constitute the dependent variable measured— observed the group make 
up a story. Phase I of this e^eriment differed from the basic design in 
that every performance given in Phase I was evaluated publicly by the 
experimenter. One of the three children received consistently positive 
evaluations; the other two received about half positive and half negative 
evaluations, (These performances were not volunteered; each child was 
selected in turn,) The intent of this procedure was to raise the 
"observer" child's expectations for one of the three children relative to 
the other two. In Phase II, the four of them were defined as a "team," 
in coiqpetition with other teams, and with the former observer as Captain, 
As each sentence of the story telling task was read, the Captain would 
decide which of the other three children would give the team's word. 
There was no opportunity to volunteer* The prediction, of course, was 
that more action opportunities would be given for whom higher expectations 
were held. 

What happened was that the Captain invariably distributed action 
opportunities on a strictly equal basis. Sometimes he called on each 
child in rotation, sometimes alternating, but always with striking equality. 
At first we thought this might be due to lack of task orientation, so we 
increased the competition between teams by giving prizes and by making 
the other teams visible to the children before and after Phase II, We 
also thought the number of children might be too large for remembering 
expectations, so we reduced the "performing" group to two members. We 
also ran some four-person groups. We investigated the effects of natural 
status by making the Captain older than the performers (fourth grader 
with second graders), and by using children all of the same age. None 
of these variants in the social composition or the structure of the rules 
produced any deviation from strictly equal distribution of action 
opportunities. 

We conclude that the failure of this design was due to operation of 
very strict norms of "parity," "fairness," or "equity," as they are 
variously called. Basically, in our culture, there are rules that every- 
one on a team should get a chance to play, und there are rules in the 
school .".ystem which dincouraKO makinf: differential evaluations of 
individual::. ( rt Koes without t!uyin« that Uiesc are hypocritical norms, 
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publicly espoused but seldom actually followed.) For our Captain to 
allocate action opportunities differentially, he would have had to 
violate these norms, and apparently this was not possible for him. 
Incidentally, during the course of our attempts to develop a successful 
allocation experiment, we learned of problems Joseph Berger had had at 
Stanford In the social psychological laboratory where he was attempting 
to devise an allocation experiment. Even with more control In that 
setting, and with a college age population, no satisfactory means was 
found. 

The final set of our classroom experiments (in which children "grcule" 
stories supposedly made up by other children) deals with a closely related 
Issue, and It was successful. A second consequence of espectatlons states 
held for others Is differential allocation of performance evaluations , and 
althou^ here too we had problems with the "fairness norms," this procedure 
eventually was quite successful In producing the predicted results. To o\ir 
knowledge, however, there exists at this time no successful demonstration 
of the theory's predictions regarding allocation of action opportunities. 



C. Measurement of Self-Esteem 

For the longitudinal study of the natural course of esgoectatlon 
development (described fully in Section II of this report). It was 
desirable to develop ways to measure expectations children held for 
themselves and others at the tasks they actually perform In the classroom. 
The educational literature contains numerous measvrement techniques— 
typically, these are developed hy a single rpsearcher, not particularly 
well validated, and never appear again. However because of the obvious 
advantages of adopting a standardized measurement, we tried out several 
of these for our own work. 

Beginning before this project was undertaken (July, 1971), we worked 
with a procedure to assess soclometrlc ranking of children by asking them 
to choose each other for hypothetical teams which they were told would 
then compete for some prize. Every child in the classroom Is asked to put 
on a paper hat. The hats each have three colors ^ for example, red at top, 
blue next, and green brims. However the order of colors may differ, so that, 
for exeunple, a third of the children have hats with red at the top, a third 
with green at the top, and so on. 

Children are lined up so that they can see everyone else In the room, 
and they are asked repeatedly to write their first three or four choices 
In order— choosing first from those with r«d at the top, next from those 
with green at the top, etc. Some previous literature has also reported 
putting numbered signs aroimd each child's neck, and sometimes In addition 
numbers of hats are changed during the procedure. 

Mathematically, it Is possible, with a small number of repetitions 
of this procedure, to determine the entire soclometrlc ranking of a large 
class of children. Operationally, It was impossible. Children quickly 
became bored with the procedure, they changed hats without our knowledge, 
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they didn't follow instructions about which group to rank from, and in 
general both the actual operation and the results were too confused to 
be useful. With the wisdom of hindsight, we are not surprised by this 
outcome; we are, however, amazed that other investigators report successful 
use of this technique. 

Another technique with some precedent in the literature is the Draw 
A Man task. Children are asked to draw or copy a series of stick figures, 
euid their drawings then are scored according to presence or absence of 
certain graphic characteristics (such as elaborated vs. simple, with heuids 
vs. no hands, etc.). The technique Is Intended to operate similarly to 
other projective measures such as the TAT, and some investigators have 
reported success using it to measure self-esteem. 

After some attempts to use the Draw A Man task with children of 
different Eiges (first thought fifth grades), we concluded that it was 
not useful for that purpose. The most important factor in this conclusion 
was extremely low test-retest reliability and inter-item reliability. 
However in addition, children's overall self-esteem scores on this measure 
did not correlate well with anything else known about them, such as grades, 
popularity as Judged by teachers, or other measures of expectations. 

At this point our view of measures of self-esteem is that even if 
a measure with good reliability could be found and applied to different 
social groups, it would not be useful for the sort of theoretical and 
empirical work we are engaged in. The concept "self-esteem" is given a 
wide variety of meanings in the social psychological literature, and 
there usually is little overlap between what different researchers mean 
by it. What is notably lacking from all the self-esteem literature of 
which we are aware is the two properties which are central in our 
conception of expectation states ; namely, es^ectations are relative to 
a particular task rather than, general ideas of coispetence, and they are 
relative to a particular social setting rather theui being fixed ideas an 
individual carries around with him wherever he goes. Lack of specified 
tasks and specified comparison others has produced some very strange 
findings in previous work, as we noted in Section I of this report. 

Hhe procedure we eventually developed, where a child or a parent is 
asked to estimate the child's future report card in reading, arithmetic, 
and conduct, is relational in both the senses above. In addition, it has 
been shown to produce high test-retest reliability, and expectations of 
parents and children as measured by this procedure are predictive of a 
large number of other behaviors and performances, as we show in Section 
II. 
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SECTION II 

OBSERVATIONAL STUDIES OP CHILDREaf'S EXPECTATIONS 
Doris R. Entwisle and Leslie A. Hayduk 
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Chapter 3 Cohorts 1 and 2, Middle Class 

Chapter 6 Lower-Class Children 
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CHAPTER 1 
INTRODUCTION 



Sociologists pay littlsi attention to persons below the age of 
adolescence. Modern society, nonetheless, thrusts children outside 
the family early when children become members of day-care, nursery, 
or kindergarten groups. These groups are structured, with memy of 
the characteristics of adult work groups. Most children commute daily 
to their groups, eat at least one meal with their groups, obtain 
various medical and health services under group auspices, and find 
considerable entertainment and recreation, as well as "work," with 
their group. 

Nothing much is known about a child's early escperiences in group 
life and group activities— his sociological upbringing, if you will. 
How does this upbringing affect his later functioning as a group 
member or affect the nature of adult social groups? Theorists like 
Mead and Cooley who talked of the "looking-glass self" supposed that 
the developing person shapes his self-image, especially his ideas 
about his own ability, from reactions others display when he performs 
or interacts with them. A young child's social matrix outside the 
family provides the first looking-glass. But a detailed account of 
how social process shapes the child and how a child in turn shapes 
social process remains to be given. 

In the first part of this report we have presented experimental 
studies which show explicitly how some social processes affect children's 
expectations. (Expectations can be roughly defined as a child's concept 
of his own ability In a particular activity.) In small groups of four 
children our experiments show that a child's expectations for his own 
performance ceui be raised by a suitable adult. Related e^qperiments show 
that an Improvement in expectations for one task spills over to improve 
expectations for other tasks even when the two tasks seem unrelated. We 
also show that children themselves shade their Judgements of the quality 
of another child's performance depending on whether they believe that 
child has previously performed well or poorly. Exactly the same per- 
formance, in other words, gets a different evaluation depending upon 
the Identity of the person emitting the performance. Others (Finn, 1972; 
Seaver, 1973) have shown a similar shading of Judgements by teachers 
when they possess prior Information which gives rise to e3Q>ec tat ions. 
A child's early groups, then, to the extent they resemble our exgperi- 
ments, could build a child's self-image to be consistent or inconsistent 
with his ability. 

Our escperlments aimed at specifying exactly how expectations may 
be changed and by whom. They are, in short, an attempt to specify in 
fine grain how the social "looking-glass" operates to lead a child to 
form ideas about himself and his abilities, and what the consequences 
are in terms of the child's performance. 



A noteworthy fact which should not be overlooked althou^ we have 
paid no attention to it in our es^erimental work, is that children's 
expectations are not null at the start of our esgperlments. Children 
come to our experiments with esgpectations based on whatever has happened 
to them in the past. Each has a life history filled with events that 
forecast success or failure for the future, that set the level of 
expectations for success at all kinds of activities the child has not 
yet engaged in, and that cause the child to choose to perfom or not 
to perfom. Some children come to our experiments with expectation of 
doing well even though they have never done anything exactly like this 
before. Other children come with pessimistic views about doing well. 
The two act very differently. How do they get this way? 

iniis, the second part of our report, attenpts to trace out early 
life events that may shape expectations. Observational findings will 
be presented. Ttie puzpose of the observations is to trace individual 
children, starting in first grade and sametimes a little before, to see 
what happens to their expectations as they go through the early years 
of school. When children are in first grade how do they think they will 
do on their first report card? After the first report card, do they 
modify their expectations? Are parents' and peers' esgpectations 
inportant in shaping the child's expectations for himself, and If so, 
how important? Do expectations for children in mixed racial groups 
follow a different pattern or the same pattern as in segregated groups? 



What Are Expectations? 

Expectations for the self are a set of beliefs loaded with affect 
which may or may not correspond with reality. They provide one of the 
chief social linking mechanisms between members of a social group. As 
Hamburg (I963) says: 

Man is' by nature committed to social existence. . .man's self- 
interest is best sexnred through his commitment to his f ellows . . . [ j ] 
[his] need for positive affect... may be expressed. . .as a desire 
for. . .recognition and acceptance, for approval, for esteem, or 
for making persons. . .act in such ways as to attain the approval 
of their fellow men. 

Each persons craves response from other humans and,, as others have often 
noted, there is an insatiable need for these responses to maintain and 
enhance the phenomenal self. Expectations for the self, then, are the 
set of beliefs an individual holds about himself as a consequence of 
social feedback from others, and they span the whole set of attributes 
that contribute to self-regard. 

When the child emerges from the protective circle of the family he 
begins to get social feedback from new sources— teachers, other school 
personnel, or a wider set of playmates and peers in school— as a response 
to new sorts of activities. He is responded to and evaluated on the 
bacis of his own uchlcvcraentfl, things like his ability to please the 
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teacher, to iinpress his age-mates at ball playing or running, to wear 
clothing other children admire, to forecast others' responses to him, 
and, very much, to achieve in those academic areas like reading that 
the culture values highly. The standards of Judgment may be very un- 
clear to the child (as our data indicate) but he is not in doubt about 
whether others are according him esteem or barely tolerating him. He 
internalizes these evaluations and uses them to structure his expecta- 
tions for himself. 

Once the child enters school the process of social comparison 
begins in earnest. His expectations for himself can be the bulwark 
supporting the acquisition of con^petent behaviors if evaluations from 
others are supportive. Favorable other- evaluations are translated 
into favorable self-evaluations. As will be seen, our data suggest 
that most children have very optimistic expectations for themselves 
when they start school— the vast majority (80^ or so) eaqpect to get 
the highest possible mark in reading. Obviously, however, the process 
of social comparison becomes more discriminating as the social reference 
group enlarges. In the family what little social comparison there is 
on the basis of achievement is mitigated by ascriptive characteristics 
<the U-year-old is not expected to perform as well as his older sibling) 
or even distorted by ascriptive characteristics (the pretty child may 
be more warmly regarded than the ugly child even though both perform 
to the same objective standards of excellence in behavior). 

In the school ascriptive characteristics are less important 
because the arena of behaviors is greatly enlarged by the intrusion 
of achievement behaviors, reading, arithmetic, and the like, so the 
basis for social comparisons differs. More important, the numbers of 
individuals involved in the social comparison process greatly expands 
so that feedback now comes from many more sources and evaluations are 
made with respect to many more fellows. Only one child can be "best 
in reading so with 30 children in a class, 29 suffer some loss of 
reward. There is some experimental evidence (Gibby ^jad Gibby, 19o7) 
that failure causes a drop in the self concept and a perception of 
being less well- regarded by significant others. If failure is less 
than complete, i.e. if eomeone else outdoes a child in a given activity, 
the child may still perceive himself to fail, as he does in a relative 
sense. It is little consolation for a student who gets a C in ^♦th' 
year French to realize that he can understand French better than 99^ 
of Americans. He is concerned with his performance relative to other 
Uth-year French students. Similarly the child in a middle-class school 
who receives a low mark in reading in first grade will think of himself 
only in terms of others in his class and their marks. 

There is also the potential, with more evalutators, for evaluation 
to differ. A little evidence (Keren sky, 1966) indicates that pupils' 
self-perceptions can differ significantly from teachers' perception 
of them and that the two have different sets of expectances regarding 
school behavior. As will be seen, we find that average expectations 
of parents and teachers are equivalent but that child-by-child the parent is only 
litUe better than chance at predicting how the teacher will evaluate his 



child. It is very easy, therefore, for the parent to eaqpect an A while 
the teacher expects, and awards, a C. 

In any case the early days of school plunge the child into a new 
environment of social comparison Euid the net residue of these comparisons 
shapes his evaluation of himself, what we call his "eacpec tat ions." His 
views of himself are thougjifc to be a crucial component in coinplex mental 
processes, filters if you will, which color and even forecast his 
experiences. Lamy (1965), for example, found for 52 first-grade students 
in a laboratory school at the University of Florida that self perception 
in kindergarten predicted reading achievement in first grade as well as 
IQ, and Wattenberg and Olif ford (196U) could predict reading achievement 
two and one half years later from measures of self-concept procured in 
Itindergarten. The later study indicates^ besides the importance of 
self-concept for reading, the causal priority of it. 



Why Study Expectations? 

Several large-scale studies make the consequences of holding high 
or lo\/ expectations abundantly clear. In the famous Coleman (1966) 
Report, for example, minority group members revealed that they had low 
expectt'.tions for their own effectiveness in controlling events in their 
own liv^s and it turned out the lower their feelings of control, the 
poorer their academic records. There was r.o direct evidence available, 
however, as to how and when these ideas were acquired. Also In Brookover's 
(1962, 196U, 1965, 1967) longitudinal study of students from the seventh 
through the twelfth grades, students' self-conceptions of ability were 
shown to predict 'performance better than IQ. We do not know how these 
self-conceptions of ability were Initially established, or what led some 
students to have high self-concepts and others low. (Shaw, et al., 196O, 
also does a correlational study of self-concept Guid academic achievement.) 

Surprisingly, there are no studies which focus on academic self- 
concepts of younger children or on how academic self-concepts are established 
in the first place. Lesser (1972) says there must be important effects 
when "the child exhibits his elementary skills like naming letters or 
numbers in the presence of someone who cares about him and receives 
attention and admiration. " But how in fact a child develops as image 
of himself as a competent and effective, or incompetent and ineffective 
person is shrouded in mystery. 

Young children are much harder to study than older ones, and thie 
in large part is responsible for the dearth of research upon expectations 
and aspirations of children in their early achool years. What is to be 
gained from this added effort? The answer is "much" on two counts. 
First, folk wisdom and a wide array of scientific evidence suggests that 
by the end of third grade a child has developed a fairly complex and 
stable self-image. Performance levels at that time are surprisingly 
good predictors of subsequent performance. In Husen's (1969) large 
cross-national study, for example, intelligence scores and teachers' 
ratings in third grade were very good predictors of subsequent educational 
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careers. Personal style and social interacts skills may also have 
considerable stability after third grade, for Kagan and Moss (1962, 
p. 272) note that "the first four years of contact with the school and 
peer environments. . .crystallize behavioral tendencies that are main- 
tained through young adulthood." Many workers Including Kraus (1973) 
guess that a child's e:)qpectatlons are fairly stable by the end of third 
grade. One question the present research addresses then Is: Do 
expectations become stabilized early In the school career? 

Second, If one wishes to liqprove expectations, what paths of action 
are apenl In the experiments where children's expectations are raised 
we show that positive evaluations by a suitable other are sufficient 
to raise esqpectatlons. This way of raising escpectatlons was drawn 
directly from theoretical assertions and from closely related laboratory 
work with college students In esqpectatlon states theory. But we have no 
precise Idea of how young children's expectations are raised or lowered 
naturally In kindergarten or first grade. If a child gets a low nark In 
conduct, does this have any noticeable effect on his expectations for 
reading? Or if a child has higher expectations for his own performance 
than his parents hold for him, do his expectations decline or do his 
parents' expectations increase? To change children's expectations by 
having some kind of "expectation ralslug" session fashioned after our 
experiments might at times be feasible for children with unrealistically 
low expectations. One would think it preferable, however, for a child 
to begin school with high expectations for himself and then to have those 
expectations confirmed rather than to try to elevate expectations after 
they have been pushed down. It may be next to inpossible, for example, 
to improve a child's expectations for academic success by the time he has 
reached Junior high school if he has experienced little success up to that 
time. 

Utie reason is clear. Once the self-fulfilling prophecy is put in 
motion It tends to perpetuate Itself. If a child's first report card 
rates him low on reading, his expectations may be shattered. In addition, 
hw may then be assumed by his teacher to "be slow," and given little 
time or opportunity to prove himself. His classmates may not pick him 
to be a participant in games or contests Including reading. Our experi- 
mental data (Webster & Entwisle, 197^) show clearly that expectations 
influence evaluations and the same performance gets different evaluations 
depending upon the presumed ability of the performer. Also, Seaver's 
(1973) study shows that expectations first-grade teachers hold because 
a child follows his "bright" or "dull" sibling in the teacher's class 
are iiQportant determinants of evaluation. There is some limited evidence 
that expectations for the future reflect initial levels of success 
(Adelman, 1969) and also future achievement Itself (see Finn, 1972). 
Because of one event, say a first mark that happened to be low, both 
the way a child is treated and the kind of treatment he comes to expect 
may be altered. Such events could have profound implications for a 
child's academic career. 



Two kinds of effects ei^phaslze the Iniportance of expectation 
setting. First, considerable evidence exists (Rist, 1970; Palardy, 
1969; Lambert, 1970 ) that teachers' expectations are not "open to 
evidence." Some teachers (see Palardy, 19^9) expect boys to do as 
well as girls in reading and boys in their classes do tend to do as 
well as girls. How children behave or perform, in other words, may 
depend less upon their natural talent and effort than upon the expecta- 
tions that shape teachers' responses to them. Second, although teachers 
can form esgpectations on other grounds than objective test scores, 
as the Rist (1970) and Williams' (1970 ) studies show, there is only 
a slight relation, if any, between IQ's obtained early in the school 
career and early performance levels. Forming expectations on the 
basis of IQ test scores may not be any fairer. As the reader will 
see, when first-grade middle class children in this study are sorted 
into quartiles according to IQ test scores, there is no relation 
between IQ and early reading performance. Tltias it does not make sense 
to recommend that teachers group children on the basis of tested IQ 
rather than body odor— it might not be any "fairer." Rather each child 
is entitled to have his high e^gpectations not destroyed so they will 
lead him to perform and to learn. 

There is considerable confusion about subverting "academic 
excellence" and the use of ranked grading systems. The seventh- grader 
who performs well is likely to be the child who has performed well 
in sixth grade, fifth grade, and all the way back to kindergarten. 
But surely the curriculum of the elementary school is within the mental 
grasp of all children except those with marked and pathological retarda- 
tion. There is no reason to think any child cannot learn the multi- 
plication tables, or considerably more conqplex materials, under "proper'' 
conditions. Proper conditions may include having expectations for a 
reasonable degree of success. 



The Social Context 

There are two major lines of research that point to the ii^portance 
of the social context in measuring expectations. One is the perplexing 
and often contradictory results in surveys of the level of self-esteem 
of black or disadvantaged ckhildren, viewed in the perspective of 
Rosenberg and Simmons' large cross- sectional survey over several contexts. 
The other is a conflict in findings between some experiments of Cohen's 
and of our own concerning how black children perform in mixed- racial 
Interaction. 

Some studies of social interaction in mixed racial groups of children, 
all with older children, indicate that blacks suffer from interaction 
disability (see Cohen, 1972; Cohen, et al., 1972; Cohen and Roper, 1973; 
Katz and Benjamin, 196O; Katz, Goldstein, and Benjamin, 19^8). These 
studies concern older subjects (Junior high age or older) than those 
In our experiments and the findings run somewhat counter to what we 
observe in our experimental studies with younger children. We did not 
find that black children who attended an Integrated (6o^ black) school 
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from kindergarten on were disabled In Interaction with whites In 
mixed racial work groups (see Entwlsle & Webster, 197^b), We found 
coo^arable volunteering rates and equal susceptibility to e:Q>ectatlon 
raising. It Is liQportant to study how young children's self-expectations 
develop In natural settings for two reasons, (1) because our observations 
of blacks In mixed racial groups differ strikingly from Cohen's and 
(2) because no one has ever before looked to see how e?gpectatlons 
develop starting at the time when the child enters school. 

A word Is in order here concerning the use of the concept expectations 
In this analysis. By "e^cpectatlons" the authors mean a realistic guess 
of an evaluated performance. In particular a child's ei^ectatlon refers 
to the child's guess about the mark he will receive— his scholastic 
performance as evaluated by the teacher. A parent's estpectatlon like- 
wise is the parent's guess concerning the teacher's evaluation of the 
child's scholastic -Performance, the mark his child will receive. The 
terms "forecast" and "hopes" occasionally appear In this analysis in 
place of "expectations" where they provide an easier Intuitive grasp 
of the data; but by such terms the authors do not Intend anything other 
than "ejQ>ectatlons . " 

Self-esteem, which one might term the global set of self- expectations 
and the affect surrounding them, has so far been difficult to measure. 
A fundamental stumbling block is the lack of a handy validating criterion. 
Who but the person himself is party to his feelings of self-confidence, 
his pride In ownership or accomplishment, his hopes for the future, and 
his memories of past successes or failures? What Indicants of these 
exist outside the person himself? If in a classroom a child raises his 
hand often to volunteer answers one might think the child had high 
self-esteem. Unfortunately, however, volunteering may indicate a need 
for attention or a need to impress classmates Just as well as self- 
confidence. In the case of intelligence, defined narrowly for purposes 
of test construction in terras of performance in school, a handy measuring 
rod exists. But self-esteem has no such hand^' yardstick. It Is hard 
to specify criterion behaviors or the evidence to use In measuring 
self-esteem. Dickstein (1972) sensibly chose a multiple criterion, 
peer popularity plus academic performance, and found validity coefficients 
in the range from .hO to .50. Such validity coefficients are much lower 
than those customarily found for cognitive tests. It would probably be 
a mistake, however, to insist that a self-esteem measure correlate more 
highly with this multiple criterion, for self-esteem obviously has many 
more facets than the two represented in the criterion. 

Self-esteem, as William James pointed out, can be thought of as the 
quotient of the individual's successes compared to his "pretensions." 
Another way to express this is to define self-esteem as the discrepance 
or residual between achievements and escpectations. If a person has high 
expectations and achieves well, then the discrepance is small. This kind 
of person will have higher self-esteem than the person with high expecta- 
tions who achieves average or below and has a large discrepance. On the 
other hand if the person has modest expectations and achieves at a high 
level one would expect his self-esteem to be higher than the person who 
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achieves high against a background of high expectations. Such analyses, 
although they may have some relevance, need to be carefully balances in 
terms first of the individual *s information-processing ability and 
second in terms of what is "high" and "low." 

Persons process information about themselves to form an evaluation. 
Becuase of a possible need to preserve and enhance the self, negative 
information is dovmplayed or perhaps even ignored. Thus a child who 
does poorly in arithmetic may literally lose sight of evaluations in 
this area, or turn his attention to other "more loiportant" areas. There 
is some evidence of this in the Rosenberg and Simmons study where black 
children with low IQ's actually report having parents who see IQ as 
less important for schoolwork than children with higher IQ's. Self- 
esteem as such may be undifferentiated in young children and based on 
factors that the group as a whole does not see as value. A young child 
as we know mainly from observation may place great value on an object 
that no one else values. He similarly may regard himself or his own 
characteristics in ways other persons do not share. He may be ill at 
ease because he has freckles, or because his speech differs in small 
phonological ways, or even because he is later in losing his deciduous 
teeth than his classmates. It takes some time for e^gserience to 
accumulate if he is to discover what the relevant patterns are and how 
he stands on them. Thus self-esteem early in the school career may 
easily be "lop-sided" or grossly inflated or deflated. As time passes 
however the child's self-esteem may converge toward an evaluation which 
would coincide with an evaluation made by significant others. In the 
early stages however his e3Q)ectatlons for himself in various important 
areas are probably not well- Integrated. Certainly for school subjects 
and for peer-group performance he often has insufficient data to Justify 
(i stable estijnate. Social context can be regarded as a feedback generator, 
but the amount of feedback obviously depends on how long the child has 
been in that context. 

What is "high" and what is "low" depend entirely on context. A 
child in a lower-class school who receives an A may be performing at a 
high level compared to other children in his class. Still in absolute 
terms he may be performing poorly compared to children in a suburban 
school or coiqpared to national grade norms. Ttiis is so obvious that it 
hardly needs to be said yet in studies of self-esteem in older children 
comparisons have been made across groups ignoring social context. To use 
a physical analogy, this can be likenod to saying that an object that 
weighs one pound on earth weighs one pound on the moon. 



The Model 

Bronfenbrenner (1973) has recently called for research on children 
that embeds the child in a social context and that examines feedback. 
He also points to a need for research in naturalistic settings. The 
present research addiesses both these matters. In addition, as the next 
chapter will specify at considerable length, the research is conceived in 
terms of a specific causal paradigm. The aim is to specify coDQ)onents 
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of a model, eventually specifying the components in a precise quantitative 
manner. The data analyzed for this report, although voluminous, are 
still not voluminous enough to allots many precise quantitative state- 
ments. On the other hand, they do suggest that such statements could 
be made by adding more data of the same kind, a task we are presently 
pursuing. 

Studies of status attainment in adolescents see the individual 
nested in a context of significant others where parental pressures, 
peer friendship commitments, and school (teacher) factors operate to 
shape vocational or educational aspirations. Our model of the e:qpectat ion- 
shaping process in young children is reminiscent of those models, with 
the additional coDq)lication that we measure variables at several points 
in time. 

The model underlying the design and analysis is a dynamic one. It 
is thought that the child's initial expectation level, shaped by home 
factors before he comes to kindergarten, and little influenced by 
"objective achievement levels," is heavily but diffusely influenced 
by what happens early in school (kindergarten). Major focused influence 
is exerted in first grade as achievements are evaluated by his teacher 
and his peers, particularly in the prime academic areas of reading and 
arithzoetic. In addition his parents, who heretofore have shoim concern 
for his performance in non-academic areas, now indicate that they have 
expectations for his performance in school, particularly in substantive 
areas like reading and arithmetic. As he performs and is evaluated, 
peers, parents, and teachers modify their expectations for him and 
convey this information to him by way of evaluations. Ihe diagram below 
summarizes the feedback processes believed to shape expectations. The 
process portrayed by the model operates over time . The time frame for 
measuring variables implicated in the process will be indicated in the 
data collection schedule. Ttie principal advantage in using a longitudinal 
approach is that it permits the teasing out of causal patterns. Only by 
observing that some consequences follow particular antecedents can one 
isolate causes. Changes in level, i.e., dynamic rather than static 
effects, are the main targets of the analysis. As the reader will see, 
discrepancies in expectations and performance are what lead to eustion 
or change, not the absolute level of expectations jger se. Failure to 
observe self-image in this way is probably one reason previous work has 
led to confusing results. Also, as mentioned earlier, expectations or 
self-image can be measured only in a social context, an approach pre- 
viously overlooked. Expectations are of necessity relative. 

Studies by Brookover and his associates (1962, 196U, 196^, I967) 
show that older children have different "self-images of ability" for 
different substantive areas, and our work here shows that first-graders 
have very different expectations for reading and for arithmetic. 
Accordingly, one can imagine the model operating for a child in 
reading, separately in arithmetic, and perhaps also separately in 
conduct, the three areas for which marks are collected from school 
records. The feedback process probably operates independently for the 
olillii's percept lon« of his own sociometric standing as well. 

■ / * 
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Children are drawn from three residential locales whose character- 
istics might cause the variables in the model to have different ioipacts 
(i.e., the path coefficients may differ depending on the social setting). 
Black children in an all-black context, for exaiqple, may respond to 
peer esgpectations more (or less) than black children in an integrated / 
setting. Or white children with a black teacher may respond less to 
teacher evaluations than white children with a white teacher. (Some 
limited data for black Junior high school students show less influence 
of peers' upon black children's expectations and a weaker relation 
between ability self-concept and performance for blacks than for whites, 
although for both races self- conception of ability was a better pre- 
dictor of performance than IQ. Morse in Brookover, et al., 1967*) 
Thus the dynamic model could operate fairly independently for the 
same child depending upw whether performance in reading, arithmetic, 
or conduct is involved. Also the model could assign different average 
values to factors like "peers' expectations" depending upon the broader 
social context of the child's social milieu. The model, in other words, 
may specify feedbacks to the child of different inportance depending 
on the child's social class or racial context. 

It is important to notice that by aggregating over several first- 
grade (or second-srade) saiqples from the same context a sufficiently 
large set of data can be obtained to perform the planned analyses. The 
writers feel that this aggregation is superior to aggregating first- 
grade cohorts from different schools because of the homogeneity of the 
samples from the seune school over short (2 or 3 year) time spans. So 
far there are two first-grade cohorts aggregated for the middle-class 
school which will be analyzed in Chapter 3* The lower-class school 
has only one cohort so far and results for it are given in Chapter 6. 
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CHAPTER 2 
METHOD AND DESIGN 



The Conceptual Frainework for the Design and Analysis 



Psychological and Sociological Paradlggns 

One way to conceptualize educational research Is by the unit of 
analysis. Traditionally educational psychologists have analyzed 
Individuals 1 for exaaqple, hov a person's perceptual skills affect his 
reading achievement. Sociologists of education, on the other hand, 
have analyzed social aggregates > addressing such Issues as the school 
performance of subcultural or minority groups. Both lines of inquiry 
look at outputs, achievements of the individual or of the social group. 
Both tend to skip over hovr Inputs are converted into outputs of achieve- 
ment, in particular what interpersonal events foster achievement. 
If middle-class children read better than lower-class children, 
exactly what happens day by day in a middle-class setting that brings 
about superior reading? What social processes or activities occur to 
account for the differences? 

This research tries to trace out how sequences of events or patterns 
of social Interaction lead children to take particular views of themselves 
as performers in school. The unit of analysis is the school child nested 
in a group of significant others, his parents, his peers, his teacher. 
19il^ xmit bridges both the psychological and sociological units of 
analysis mentioned above, and focuses on the articulation of the child 
with his group of significant others. The aim is to trace out hCM social 
factors and processes are tremslated into expectations of children And 
how th^se expectations, in turn, affect children's performance. The 
design and analysis draw upon both the psychological and sociological 
tradition of research. 

Like longitudinal research in psychology, children are measured at 
several time points. The child's e3Q>ectatlons for himself are assessed 
repeatedly from a time before he receives his first report card to the 
end of the second grade in some cases. Individual curves can be derived 
which show how a child's hopes for his own achievement change over time, 
indicating whether, for example, a given child's expectations remain 
stable or move. With expectations rising on the average over the first- 
grade year (as they do for instance in the white middle-class school) 
there is opportunity to see whether all children's expectations rise a 
little, or whether some rise a great deal while others decline slightly. 
It could turn out that a child's expectations decrease if his earliest 
expectations are not confirmed, or equally well that his esgpectations 
remain fixed and his performance improves. 

The most obvious advantage of longitudinal data is the one hinted 
at Just above: it does away with the confounding inherently present in 
cross- sectional data and in aggregated data. An average gain can imply 



many different kinds of performances at an Individual level. As has 
received much attention in the psychological literature, however, 
longitudinal data need careful study to separate out (1) effects of 
age, (2) effects of cohort and (3) effects of time of saiqpling. An 
exanqple will make this clear. A child who is 3 in 1970 is obviously 
different from a child of 6 in 1970 because of attained age, but also 
because the ^-year-old is drawn from a 196^ cohort and the 6-year-old 
is a member of a 1964 cohort. In addition data from both children, 
being taken in 1970, may differ in important ways from data procured 
in 1971 from ^ and 6-year-olds at that time. The conditions of study 
in 1971 differ twice (sampling difference) and the events intervening 
between 197O-71 differ from those between 1969-7O (cohort differences). 
So effects of age, cohort, and time of scuqpling all need specification. 
Fortunately in the present research the time- span is relatively short 
and the same school and neighborhood are used for different cohorts. 
It is therefore assumed that differences between first and second graders 
of the same cohort are largely age or experience differences rather 
than differences stemming from time of sampling. By looking at cohorts 
separately and neglecting time-of-saiqpling differences— looking at, for 
example, first graders in the same school in 1971 and 1972— to see if 
samples of the same attained age differ, we can estimate the size of 
cohort differences. As the reader will see, cohort differences, insofar 
as they are available (for one school only) appear small. This fortunate 
state of affairs allows cohorts to be combined and, neglecting time of 
sampling on a priori grounds, we then attribute changes to differences 
in age or maturation. 

The research also draws from the sociological tradition. Like 
sociological studies, this work studies contextual variables and tries to 
Impose a causal paradigm. Children's expectations at the end of first 
grade, for example, are studied in the context of (l) expectations of 
the child earlier that year, (2) parents' expectations, and (3) previous 
marks received. The number of cases involved (so far 1^ at the maximum) 
and the nature of the measurement (usually ranks) does not permit a full 
correlational or path-analysis type of treatment at this stage in the 
research. However, repeated cross- tabulations are carried out where change 
in ono variable is tabulated against change in another variable. Such an 
analysis can examine whether children with low initial expectations revise 
their expectations upward after receiving high marks, or whether marks 
and expectations change simultaneously so as to increase the agreement 
between them from one time point to the next. Or, to take another exanqple, 
if parents on the average expect boys to do more poorly thai, girls in 
reading, what is the course of particular boys' expectations whose parents 
hold very low expectations and how are the e:q>ectations of both parents 
and boys affected by marks the boy receives? Or, to take still another 
example, if black children have higher expectations at the start of school 
than white children, is this linked to higher expectations of black 
parents, black teachers, or peers for these children, and does this 
elevated state persist? 

Sociologists have taken particular interest in status attainment by 
adolescents because of its obvious links to classical problems in sociology, 
how social stratification is effected and what makes it persist, for 
example, or the converse problem of how and why social mobility occurs. 
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This research tries to investigate variables similar to those in the 
status attainment work but our data are drawn earlier in the life cycle. 
The principal "attainments" are performance in the core areas of reading 
and arithmetic. The present research Includes "significant others," 
parents, peers, and teachers, and the influence exercized by each Is 
assessed. This procedure closely peirallels the sociologists' strategy 
with older individuals. Also In parts of our analysis there in no Intent, 
as In the psychological- type studies alluded to earlier, to follow 
individuals one by one or to trace out Individual paths. Rather, like 
sociological studies of status attainment, the overall thrust is to 
evaluate unique variance e:gplained and to assign direction of causality 
between variables. The main difference between the present study and 
the usual kind of sociological analysis is that attainment level is 
measured several times (twice each school year), so the model is dynamic. 

The melding of the psychological and sociological approaches found 
in this research is novel and allows study of the dynamic causal model 
depicted in Chapter 1. The longitudinal method Is often extolled because 
it allows one to observe change In behavior as the individual grows. 
Obviously, however, time cannot be viewed as the Independent variable In 
the sense of causation. Rather this method allows study of sequential 
intra- individual variation. If hopes of children In one cohort assessed 
at the end of first grade are weighed against hopes of other children In 
second grade, one might conclude, other things being e^ual, that the 
average level of hope had risen or fallen. One might also calculate the 
correlation between hopes and marks within the first-grade saoople or 
within the second-grade sample. One could not, however, conclude that 
high hopes In the second grade were preceded (or caused) by high marks 
in the first. Thus the advantage of the longitudinal method lies In 
teasing out causal patterns not In detecting successive differences with 
greater power, an aim often cited by developmental psychologists and by 
sociologists who carry out parallel studies. 

Also it should be noted that aggregation of some kinds will be 
carried out— similar cohorts can be combined— but other kinds of aggre- 
gation will be avoided, largely on a priori grounds. Separate analyses, 
for exanqple, will be carried out for samples from segregated and integrated 
schools. Also In many instances, sex effects will be searched for by 
analyzing data for boys and girls separately. This kind of preservation 
of context and attention to the cluster of significant others leads to 
the discussion In the next section of nesting of subjects. 



Early Stages In the Life Cycle 

The present research measures variables In young school children 
which may be potent determinants of later achievement euid of ultimate 
status attainment. To our knowledge, t^ese variables have not previously 
been studied In a sociological- type cau.al framework. Several studies 
suggest that by the end of third grade the prediction of eventual attain- 
ment may be almost certain. Both national (Bloom, 196^; Kraus, 1973) and 
cross-national studies (Huaen, I969) show very little change in children's 
achievement levels in school after that time. These findings, it should 



be noted, are in accord with much folk wisdom as well. If this kind of 
early asymptote then exists, two conclusions may be drawn, (l) Status 
attainment may be predictable from points very early in the life cycle. 
Despite data showing that peers and the social context in high school 
may account for considerable variance in later attainment, these peers 
me^ be surrogates in a sense, carriers of influence for peers who inter- 
vened much earlier. If a person's level of educational attainment is set 
by the end of third grade then perhaps his peers before that time have 
shaped his tendencies to achieve. His peers later, in high school, are 
"influential" because they match his earlier peers in their characteristics. 
It is only because he remains in a fairly constant environment that both 
sets of peers appeeur equal. (2) Policy-oriented or action research should 
perhaps focus on school children at a much earlier point in the life cycle, 
before the point when eventual attainment becomes predictable. There may 
be little use in altering school or school environments after the early 
primary years if educational attainment levels are as stable as the studies 
quoted above suggest. 

The generally equivocal and discouraging results from Heads tart not 
withstanding, it may be that the primary school years are crucial. For 
one thincr, the Headstart programs concentrated on cognitive variables, 
langu e ""Is and the like. There was almost no effort directed at 
affective eut motivational variables, and little attempt to assess such 
variables. Expectations are a kind of affective variable. Early in the 
life cycle before much feedback has occurred, they may be very influential, 
and independently influential. Later expectations may covary strongly with 
cognitive variables because the kind of performance the child displays will 
lead to various kinds of evaluations. A child by the end of third grade 
may be fairly good at forecasting his own performance; his expectations 
then may reflect less his positive or negative feelings toward himself 
and the subject matter than his recollections of what has happened in the 
past. Also teachers may not evaluate him on a "clean slate" at each 
evaluation opportunity but come to e3Q>ect a certain level of performance 
and "see" that level of performance no matter what the child does. (A 
recent study reveals that teachers even carry over a "slate" from one 
sibling to the next sibling in the family (Seaver, 1973).) A number of 
considerations suggest that very early in the school career a child's 
experience could be crucial. 



The Importance of Social Context 

The social context germane for this longitudinal study derives from 
two separate theoretical orientations. The first, the nest of significant 
others, has already been linked in Chapter 1 to the developmental "looking- 
glass" of Mead and Cooley and to Sullivan's set of "significant others." 
The set of significant others has been an important and continuing concern 
in sociological studies of status attainment. The only difference in its 
use here is Its application to students who are much younger than those 
previously studied and who have Just emerged from the protective circle of 
the family. 
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The second theoretical strand, more macro- sociological In its roots, 
links this research to the stratification system of the society as a 
whole. The cross- sectional study of social class differences is firmly 
established as a research paradigm in both education and sociology, for 
example studies of school achievement over various levels of social 
class. The repeated demonstration of social class differences has 
reached a point of diminishing retuitis, however. Middle-class children 
are on the average better readers at evezy grade than lower-class 
children. This fact Is well established. What is not known Is the 
causal priority to be assigned to social factors leading to the associ- 
ation between social class and performance. One large and controversial 
question at the moment is the causal priority of genetic or environmental 
factors in school achievement. The authors wish to sidestep this problem 
and concentrate on other problems less sharply in focus at the moment, 
but perhaps equally important. 

What are some provocative and counter- intuitive findings that could 
shed light on causal priorities of social factors? An outstanding recent 
example is the shower of studies showing that low SES or minority child- 
ren do not have lower self-concepts than middle-class or majority- group 
children. This finding has been observed often enough so It Is unlikely 
to be overthrown. It was une^qpected because workers assumed that the 
self-view of minority-group children would reflect the view that the 
society at large held for their minority group. Thus adult blacks enjoy 
relatively low prestige In the larger society and so it was assumed that 
they— and black children— would hold low views of themselves as individuals. 
This picture now seems quite false, at least as far as children are con- 
cerned. 

Another exception, less widely known, is the finding by one of the 
authors (Entwisle, 1968) that inner-city (disadvantaged) first-grade child- 
ren of average IQ have developed linguistically to a point attained by 
suburban first graders in the gifted IQ range. I^ils finding has not been 
independently replicated and so is not as firm as the finding of high 
self-esteem for minority- group children, but it holds up over more than 
one measure in the same study and is consistent between black and white 
sub-samples within the study. These two counter- intuitive findlngs-- 
high self-esteem of blacks and advanced linguistic development of black 
first graders- -underline both the importance of events early in life and 
the need for social context to be preserved. 

Rosenberg and Simmons' (1971) work is especially informative with 
respect to the impact of social context. In a large cross- sectional study 
of children at third-grade and higher, they note that minority-group 
children in certain contexts have hi^er self-esteem. The overall picture 
for minority- group children is for higher self-esteem because most, of 
them are in consonant social contexts which protect self-esteem. The 
black child in an all-black neighborhood and all- black school compares 
himself with those around him. He does not use children or families with 
which he has no contact as reference groups. For this reason, if his 
parents are separated or divorced or if his actual father is unknown, his 
family status is not a source of embarassment or sorrow because many of 
his confreres have the same problem. A black child from a broken family 



vrho attends an integrated school, however, rates himself in comparison 
with his classmates whose families are intact. In this atmosphere a 
broken family produces social stigmata. 

The impact of other facets of school life that might also be thought 
to lower self-esteem is cushioned by self-reference to protective groups. 
The low-IQ black child, for example, reports that his parents believe that 
being smart is not very important to getting higher grades. Or, for 
another example, the black child with very dark skin rates color as less 
iiqportant in assessing physical attractiveness than the child with li^t 
skin color. 

The important outcome of Rosenberg and Simmons' work for the present 
research is that self-esteem, which can be termed a "global set of self- 
expectations," is highly dependent on context. Rosenberg and Simmons 
could not investigate causal priorities directly because their data are 
cross- sectional, but their data (retrospective reports of racial taunts) 
liQplicate the kinds of interpersonal events that could lead to lower 
self-esteem in minority-group children. Certainly racial taunts are more 
likely in a mixed than in a segregated context. 

Social context and its ioDgpact on expectations has also been pointed 
up in small-scale esgperimental work. The study of interventions to raise 
expectations of blick children in small mixed racial groups (see work by 
Cohen and her associates) when balanced against acme small experimental 
studies of our own (Entwisle & Webster, 197**a, 197**b) point even more 
strongly to contextual effects and their inqoortance. In brief, in mixed 
racial work-groups of children brought togsther for experiments, Cohen 
found that black children held low expectations for their own success euid 
that these feelings were shared by white work-mates. Blacks who up to 
the time of the experiment were resident in all-black contexts did poorly 
in the mixed-race esgperimental context even when much preparation before 
the experiments guaranteed the superior competence of the blacks. Blacks 
were given intensive crash-course training on assembling radios or the 
other tasks to be carried out with whites in later experiments, while 
whites had no such preparation. Cohen's e:qc>eriments document the low 
expectations held by \hite children for black children and that these 
low expectations impair blacks' performance. 

Guided by the importance of social context in both the large-scale 
Baltimore study of Rosenberg and Simmons and the small-scale e:gperimental 
work, the present research focuses on children within two contexts— one 
all-white and one mixed racially, 6o^ black. (Later work will examine 
all-black contexts.) This research can examine more events, and more 
specific events, than that of Rosenberg and Simmons and also, of course, 
looks at effects over time. If a child is asked to recall racial taunts, 
as In the Rosenberg and Simmons study, such a report has weaknesses too 
well-known to be enumerated here. Without being at all critical of the 
Robenberg and Simmons work, one can note that self-esteem covers meuiy 
aspects of the self. To understand its ups and downs or the underlying 
factors which conqprlse it, considerable precision Is required. The causes 
of self-esteem are no doubt manifold and subtle, and a single overall 
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measure Is probably Inadequate for explaining its causes. The present 
research attempts to look at the course of development of the academic 
self- image in particular areas, how performance feedback shapes that 
self-image, and how feedback from particular persons operates on the 
self-image. The social context in the persons of particular actors 
shapes the image, subject to overall constraints in the social milieu. 
Specifically, peer feedback in €tn integrated environment may have effects 
different from such feedback in a segregated environment. From a 
methodological standpoint it would be fruitless to try to guage the 
self-image with either the localized or general context erased. Up to 
this time studies have not combined both social context and a detailed 
dynamic feedback mechanism. 



Description of Procedures 

Three cohorts of children have been followed from the time they 
began first grade. Two cohorts are in a white middle-class suburban 
school. One (S-1) began first grade in 1971} the other (S-2) began first 
grade in 1972. A third cohort (L-1) in a mixed- race (60^ black) urban 
lower-class school began first grade in 1972. Insofar as possible, 
similcu: data and information were obtained for all three cohorts. 

The time chart below shows the sequence and timing of several of 
the repeated measures over a 2-year time span. Only cohort S-1 has been 
followed for the entire two-year period. Cohorts S-2 and L-1 have been 
followed on the same plan as S-1 but for only one year. Ttie measures 
identified on the time chart will now be described. 



The Children's Expectation Measure 

To provide a measure of the child's expectations for his own school 
performance, children were asked to "guess what your next report card will 
look like. Guess what you will get in reading... in arithmetic. . .and in 
conduct." How elaborate the interviewing procedure was which acconqpanied 
this "guessing" depended on whether the child was new to the study or had 
been interviewed previously. 

For the initial measuring of expectations, a large plastic brightly- 
colored sheet (approximately 2' x 3') was prepared (see Fig. 2.2) with 
titles of school subjects (Arithmetic, Reading, Conduct) and squares for 
entering marks in it like a report card. This sheet was spread out on a 
table or sometimes on the floor. Next to the sheet were a number of piles 
of cardboard squares with large numerals (1, 2, 3, k) inked on them. 

Children, interviewed individuallly outside their classroom or in a 
separate room nearby, were told thaL "we are going to play a game-- 
guessing what you will get on your report card." Before "playing the 
Kame," the child was asked if he knew wnat a report card was, what the 
numerals meant, and what "reading", "arithmetic", and "conduct" meant. 
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Enough discussion then ensued so the interviewer felt reasonably con- 
fident the child understood what school report cards signified and how 
marks were coded. The child was then asked to pick a number from the 
pile of numerals and put it in a square next to "Reading" to "guess what 
you will get in reading. " He was similarly asked to pick numerals to 
represent his guesses for arithmetic and conduct.^ Initial interviews 
for measuring expectations in reading, arithmetic and conduct were held 
slightly before the child received his first report card in first grade. 
Report cards are issued 3 times in grade one, midyear, year end and 
half way between midyear and year end. Only midyear and year end report 
card information is used. In second grade report cards are issued four 
times— roughly corresponding to the end of each of four quarters of the 
school year. 

As the child made his guesses the interviewer unobtrusively recorded 
check marks on a small 3" X 3" card. The cards were kept out of sight 
and it is doubtful if any of the children were aware their guesses were 
being recorded. 

This same guessing procedure was carried out twice yearly during 
each school year. In first grade it occurred once before the child re- 
ceived his first report card (November for L-1, December-January for S-1 
and S-2), and again before the end of the school year in lare May or early 
June. In second grade it was carried out Just before the end of the first 
semester and again near the end of the school year. 

As will become clearer from the analysis of the data, the expectation 
measure appears to have a fair degree of validity and reliability indicated 
by re- interviews, meaningful relationships that emerge between it and 
other variables, and a substantial degree of test-retest agreement when 
guesses for the same child are matched between one session and the next. 
A special reliability check run on a small saiqple of first graders with 
a short time span (one week between test and retest) indicates a high 
degree of concordance between e3Q>ectations elicited on the two occasions (.76). 



Sociometric Data 

TWO methods were used to procure children's sociometric standings 
within their own classroom. The first used a multiple-choosing procedure 
which forced a child to pick 6 children on two occasions (12 choices 
altogether) in such a way that choices would be widely distributed. The 
procedure went as follows. The class was randomly divided into three 
"teams," designated white, red, or blue. Then every child picked two 
persons from each team. The class was then redivided randomly into thirds. 
(One third of the former white team remained white but the other thirds 
were, respectively, now designated red or blue, with similar re-partitionings 




At a later time a separate interview was conducted with each child 
by a different interviewer. These interviews verified the child's under- 
standing of marks and marking systems without reference to the guessing 
task. The independent verification is discussed in connection with results. 
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for the red and blue teams of the first round.) Again everyone picked two 
persons from each team. The results were tallied as the number of times 
each person was chosen and were later connected to the proportion of choices 
each child received. The scores were then normalized to be between zero 
and one. Tlhe normalization re-weighted choices to fall on a similar scale 
despite differing numbers of children per classroom. The multiple-choosing 
procedure Just described was used only with S-1 in first grade. 

The multiple-choosing procedure was too complex to be used with 
lower-class first graders. Another sociometric procedure was therefore 
developed and used with all classrooms starting in 1972 (S-1 in second grade, 
S-2 in first grade, and L-1 in first grade). The new procedure involved 
having a boy and a girl in each class designated as captains. Captains 
then took turns choosing their classmates to be members of their team in 
order to "play a reading game." Children to be captains were nominated 
by teachers. Teachers were asked to choose the boy and the girl in their 
class who were most proficient in reading and socially mature. 

Ihe experimenter, on entering the classrooms, e^^lained to the class 
that "we are going to play a game" and called the captains to the front 
of the class without explaining how the captains had been chosen. As the 
captains took turns choosing children to be on their teams, a research 
assistant noted the order in which children were chosen. Children wore 
large numbers (like numbers worn by football players) so they could be 
easily identified. This procedure resulted in two separate rankings, one 
made for each captain's team, with the captain of each team being assigned 
the highest rank. In general the teams chosen were homogeneous by sex, 
girl captains choosing girls and boy captains choosing boys. The ranks 
by class were converted into scores between zero and one, so as to be 
comparable to sociometric scores obtained by the multiple* choosing pro- 
cedure and also, of course, to normalize with respect to class size. 

Socicnietric rating scores between zero and one were assigned in such 
a way that the rating scores were equally spaced along the continuum be- 
tween zero emd one according to the rank order as obtained from the social 
choice ranking (i.e. order of being chosen or order in terms of propor- 
tion of choices received) . Children tied on original rank order remained 
tied in terms of the resulting rating scale. Ties regularly occur in the 
"group-captain" procedure due to the parallel choice structure involved. 
The sociometric scale that results from the above procedure is a measure 
of the children's within room sociometric standing. 

The sociometric ranking measurement was carried out once per school 
year per cohort at some time in the second semester. When the ranking 
was repeated with one lower-class group (20 children) on two occasions 
one week apart, the correlation between rankings was .73* The way the 
choosing occurs and the high correspondence one observes for the first 
half dozen children chosen for each team both suggest that there is a 
fairly clear, stable ordering for the top half of the class. For the 
bottom half there may not be much discrimination among individuals, but 
rather a generalized perception that the group as a whole is less able. 
The analogue of serial order effects appears to occur for children high 
on the list but not low. There docs not seem to be a consensus as to 
who is "worst", "next worst", etc. although there is consensus as to who 
is "best", "next best" and so on down to about tenth in the class. 
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Data from School Records ; Sex, Race, IQ, Absences, Standardized Achieve- 
ment Test Scores, Teachers' Marks. 

Sex and Race . These designators are self-explanatory except that all 
minority-groups except blacks (orientals and others) were classified as 
white. 

Ift . In both schools, IQ tests are routinely given by school per- 
sonnel with scores recorded in pupils' folders. For first graders in 
both schools IQ's are obtained using the Prloary Mental Ability test 
early in the first-grade year. 

Absences. Absences were tallied for the year in both schools on 
the report card Issued at the end of first grade. 

Standardized Achievement Scores . These will be available as students 
progress in school. The middle-class school does not have standardized 
achievement tests until the child is in third grade. We have none in 
data so far but will pick these up as they become available. In the 
lower-class school the practice of giving achievement tests has changed 
starting with the cohort under study. No achievement tests will be 
given until third grade. 

Teachers' Marks . In both schools teachers assign marks in a wide 
variety of subjects (See Figures 2.3 and 2.U for replicas of report 
cards in use in the middle-class and lower-class schools, respectively.). 
The reader should note carefully that the teacher's basis for assigning 
marks, according to the report card definition, is quite different in 
the two schools. In the middle-class school the teacher atteiqpts to 
mark the child in relation to his own ability. In the lower-class 
school the teacher atteqpts to mark the child in relation to others of 
his grade level. Since both bases of assessment leave considerable 
room for interpretation, we queried teachers to get their definition 
of marking practices. Some written responses of teachers in the two 
schools, reproduced below, suggest that teachers do indeed attempt to 
operationalize the marking standards the report cards define. Samples 
of teachers' responses in each school are given verbatum below. 
Lower-class school: 

Arithmetic. There are a list of math skills that each first grade 
must be taught. Ihese skills are taught in sequential order... 
Diagnostic tests are usually given to evaluate how well the child 
has mastered a particular area. . . .Those who mastered the skill 
received "good" (3). Those who even with additional reinforcement 
only got half on the test got 2 (fair) on their cards. Children 
who never mastered the skill got 1 (poor). 

N.B. There seems to be confusion on this tecusher's part about "1" being 
the highest mark— it was uo defined on the questionnaire sheet containing 
tlie unHwer above and "1" alyo is defined as high on the face of the report 
card. Middle-class school: 
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Figure 2.3 
Middle-Class School Report Card 
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Figure 2,k (Continued) 
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When I assign grades for any subject for the first marking term 
of the year I. . . ook at the child in tenns of his own ability. 
Our nongraded system enables us to mark a child in .light of 
his achievements in regards of his capability instead of the 
traditional comparison to others who may have different 
capabilities. . . . 

If outstanding progress or 1 is to be considered then work 
must be: 

a. consistent with ability 

b. independent, self- directed, enthusiastic 

If satisfactory progress or 2 is to be considered then: 

a! child's work should be consistently accurate in subject 

b. child lacks enthusiasm 

c. child lacks self-direction in independent activities 

d. child is atiV. progressing 

If moderate progress or 3 is to be considered then: 

a. work is inconsistently accurate in subject matter 

b. child shows an erratic pattern of effort 

c. child lacks enthusiasm 

d. child shows little self-direction 

e. child is still showing progression 

If not satisifactory progress or k is to be considered then: 

a. child's work is inconsistent 

b. child shows no independence 

c. child shows no self-direction 

d. child shows no enthusiasm 

e. no progress is shown 

I determine conduct marks by considering points: 

1. consideration 

2. interaction among peers 

3. following adult requests 



Self-Esteem 

Self-esteem was measured on a scale especially developed for use with 
young children (Dlckstein, 1972) . Separate scales exist for boys and 
girls. Each scale contains three factors. These factor scores have been 
kept separate for the analysis, thus yielding three different indicators 
of self-esteem for each sex. The items (abbreviated) are shown for each 
factor, separately by sex, below: 



I 

polite 

obeying rules 

cooperative 

kind 

helpful 

honest 

good student 



Boys 
II 

sports 
strong 

playing ball 
many friends 
gymnastics 
running 
right weight 
able to look 
after others 



III 
arithmetic 
good student 
learns new things 
quickly 
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Girls 



can look after 

others 
can take care 

of herself 
polite 
good student 
learns new things 

quickly 
writing 
arithmetic 
honest 



II 

sports 

strong 

gymnastics 

playing ball 

running 

dancing 



III 

good-looking 
many fiends 
right weight 
good student 
polite 



Parents' Expectations and Questionnaire 

Parents in the middle- class school were interviewed in many instances, 
the occasion being their appearance to visit their child's classroom 
during American Education Week (late October). Those not available were 
sent questionnaires by mail. 

Parents in the lower-class school were also interviewed at school 
if they came for activities in connection with American Education Week. 
A much lower percentage attended however and the difficulties In securing 
the needed information from interviews in school led us not to mail 
questionnaires to these parents, but rather to send out interviewers to 
the homes to secure the information. Children carried home a note a day 
or two in advance of the interviewer's call notifying the parent tnat an 
interviewer would be coming, and explaining that this was a routine 
research request, not an indication that their child was in difficulty 
at school. Black adult female Interviewers were despatched to homes of 
black children, and white adult female interviewers saw parents of white 
children. Refusal rates and other data are given with results (Chapters 
3-6). 

The data procured from parents was, insofar as possible, identical 
for the two schools. The parents were asked to "guess" what their child 
"would get on his next report card" and the interviewers recorded guesses 
on sheets of paper with replicas of part of the report card appearing 
upon them. 



Analysis 



All data were coded and ..unched on cards. Data reduction was 
accomplished mainly through use of standard DATATEXT programs. Further 
analyses and tests were calculated as needed. Selected results and 
the accompanying analyses are presented in Chapters 3, k, 5, and 6, 
which follow. 
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CHAPTER 3* 
MIDDLE-CLASS SCHOOL, COHORT 1 
( STARTING FIRST GRADE IN SEPT. 1971) 



Just north of the city line in Baltimore County, Maryland is a well- 
established middle-class neighborhood served by an elementary school. 
There are kindergartens in the school, two classes in the morning and 
two in the afternoon. Hhere are three or four first grade classrooms, 
the number depending oi. yearly enrollment. 



Tabular Summary 



The first-graders who started school in September, 1971* at this middle- 
class school have been followed so far through two school years. The same 
kind of information was procured in both years. Data for the two years 
are summarized in Table 3*1 • 

Table 3.1 shows that in every area (reading, arithmetic and conduct) 
children expect on the average a (numerical mean values range from 1.63 
to 1.68) at midyear, and their expectations get higher at year end, 
especially in reading and conduct. Parents have lower esgpectations, 
averaging Just slightly above a B (1.83 to 1.96). Pai^ents' expectations 
in arithmetic are lower (2.03) the second year but reading and conduct stay 
much the same (1.87 and 1.90). Children's expectations consistently showed 
a decline between year end and the middle of the second year but a slight 
increase from the middle to the end of the second grade. By the end of 
the second year children ' s expectations averaged almost exactly what they 
did at the time they were first observed. (See also Table ^.1, columns 
labelled "Cohort 1" for a summary of means and standard deviations.) 

The difference between the average expectations of parents and children 
does not give a very accurate notion of the difference between them, for 
the children's distributions are markedly skewed. For example, in reading 
60 per cent of the children expect to receive the highest grade, whereas 
only l6 per cent of parents are that optimistic. At the other extreme, 6 
children (7.5*J&) expect to get the poorest grade whereas only one parent 
(1.2^) forecasts a failure. 

The variability of children's expectations is noticeably larger than 
parents' for every area at the middle of first grade but by the second grade, 
the variability of parents' expectations Ircreased from first to second 



*The reader may wish to read Chapter 5 first which presents combined 
results for Cohorl^s 1 and 2 in the middle-class school, and then return to 
Chapters 3 and h to pick up points not covered in Chapter 3* 



Table 3.1 

Means, Standard Deviations for Cohort 1 
Middle Class School, 1971-72 and 1972-73 





Mean IQ = ll^.Mf 
S.D. = 10.13 


Mean IQ = 104.1 
S.D. - 11.47 






1971-72 




1972-73 




N 


Mean S.D. 


N 


Mean S.D. 


Parents Expectations— Midyear 




(Time 1) 




(Time 3) 


Reading 

Arithmetic 

Conduct 


84 
01. 
84 

79 


1.96 0.57 
1.83 0.60 
1.89 0.55 


76 
75 
76 


1.87 0.72 
2.03 0.77 
1.90 0.69 


Child's Expectations— Midyear 




(Time 1) 




(Time 3) 


Reading 

Arithmetic 

Conduct 


90 
90 

90 


1.63 0.91 
1.66 0.90 
1.68 0.90 


102 
102 
102 


1.69 0.70 
1.77 0.90 
1.80 0.83 


Child's Expectations— Year End 




(Time 2) 




(Time 4) 


Reading 

Arithmetic 

Conduct 


90 
90 
90 


1.42 0.60 
1.64 0.81 
1.49 0.72 


102 
102 
102 


1.63 0.63 
1.72 0.72 
1.68 0.63 


Child's Marks— Midyear 
Reading 
Arithmetic 
Conduct 


85 

85 
85 


(Time 1) 
1.77 0.65 
1.82 0.49 
1.75 0.75 


102 
103 


(Time 3) 
1.91 0.80 

1.81 0.72 


Child's Marks— Year End 




(Time 2) 




(Time 4) 


Reading 

Arithmetic 

Conduct 


86 
86 
86 


1.69 0.76 
1.65 0.59 
1.88 0.83 


103 
103 
103 


1.60 0.65 
1.94 0.64 
1.73 0.73 


Mark Discrepance (Midyear Mftrk minus Year End Mark) 

(Tl - T2) 

Reading 78 0.10 0.70 
Arithmetic 78 O.19 O.56 
Conduct 78 -O.oi 0.63 







Expectation Discrepance (Midyear Expectation minus Year End Expectation) 

(Tl-T2^ 

Reading 85 0.24 0.88 

Arithmetic 85 0.00 l.l4 

Conduct 85 0.21 1.03 

Mark-Expectation Discrepance— Midyear (Mark minus Expectation) 

(Time 1) (Time 3) 

Reading 82 O.18 1.07 96 I.9I 0.80 

Arithmetic 82 0.I8 0.97 97 1.84 0.73 

Conduct 82 0.10 1.06 97 1.79 0.84 

Mark-Expectation Discrepance— Year End (Mark minus Expectation) 

(Time 2) 

Reading 84 0.27 O.87 

Arithmetic 84 0.02 0.88 

Conduct 84 0.37 0.92 



Table 3*1 (continued) 
I.Q. Correlations 
PMA = Primary Mental Ability 

SFTAA - Short Form Test of Academic Aptitude (from the California 
Test of Mental Maturity) 

1971-72 1971-72 



N r H r 

Midyear I.Q. (FMA) -Mark Correlations 

(Time 1) (Time 3) 

Reading 82 .221^* 79 .279* 

Arithmetic 82 .022 80 .37W 

Conduct 82 .109 80 ,01k 

Year End I.Q. (FMA) -Mark Correlations 

(Time 2) (Time If) 

Reading 82 .099 80 .145 

Arithmetic 82 .008 80 .196* 

Conduct 82 .0^3 80 -.001 

Year End IQ( SFTAA) —Mark Correlations 



Reading 


102 


.131 


Arithmetic 


102 


.201* 


Conduct 


102 


-.010 


Correlation of Tl I.Q. (IMA) with T2 I.Q. (SFTAA) 


80 


.655 



♦ = p < .05 

♦* = p p .01 



grade while the variability of children's e39ectations declined frcM the 
middle of first grade to the end of second grade. 

The teacher's rating of the child, i.e., marks, on the average, falls 
in between what parents escpect and what children es^ect. In reading the 
average mark is 1.77 compared to 1.63 escpected by children and 1.96 expected 
by par'^nts. In arithmetic parents expectations and marks almost coincide 
(1.83 vs. 1.82). In conduct the spread is like that for reading. 

The range of marks given by teachers is roughly coiqparable to 
parents' esqpected range but smaller than the child's expected range. The 
teacher gave no It's and gave Q% to 90^ A' 8 and B's overall. Their marking 
standards in general then, are relatively easy. However, as will be pointed 
out below, the relative number of A's and B's is very different from one 
area to the next. About equal numbers of A's and B's are given in conduct 
and about 3 times as many B's as A's are given in arithmetic. Reading 
falls between, with about a 3:2 split favoring B's. Teachers then, vary 
noticeably in their marking practices from one subject to the next, 
despite the similarity in the average marks assigned. 

The assignment of marks must be interpreted in light of the fact (see 
Chapter 2) that in this school children are not marked with respect to one 
another, but are supposed to be marked in terms of their own ability. If 
a child is Judged to have "high" ability but does not, in the opinion of 
the teacher, perform up to his potential, then his grade is low. 
Theoretically, to take the other side of the coin, if he performs much 
better than the teacher expects, she should assign a high grade. Fran 
Inspection of the actual mark distributions, children on the average 
appear to be slightly better in all three areas thcui the teachers expect. 



Discrepances Between Expectation and First Report Card 



Reading 

In general, for children the first mark in reading must be surprising— 
in well over one- third of the cases (3^ out of 82) an unpleasant surprise. 
In about one-quarter of the cases (20 out of 82) the child does better 
than he expiects. All children who e^^ect the worst (6) do better than 
they hoped. Perhaps the most coniprehensive c(»mnent is that 66 expect and 
get A's or B's, but of these, 39 (39^) represent reversals— those es^ectlng 
B's get A's or vice versa. There is no significant matching between 
children's e:q>ectations and marks at this early stage. Thirty matches are 
expected by chance, 28 actually occur. There is some indication that 
expectations In reading are higher than marks rather than lower in the 
cases of disagreement, but the trend is not strong enough to attain 
significance. 

Arithmetic 

In arithmetic there is likewise no significant agreement between 
children's e3q>ectatlons and the marks they receive. Matches would be 



Table 3.2 



Reading Expectation, 

Time 1 ^ 



Total 
Percent 



Reading Mark, Time 1 
X 2 3 Total 

7 ^9 



15 
12 



27 
12 



28 U5 9 
3»t.l 5»*.9 11.0 



2U 

3 

6 
82 



Percent 
59.8 

29.3 

3.7 

7.3 



100.0 
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Table 3*3 



Arithmetic Mark, Time 1 



1 



2 3 Total Percent 



1 13 35 



3 - 5 



50 61.0 



o o 1^ 15 l8.3 

Arithmetic 2 2 13 -^^ 

E3Q>ectation, 

Time 1 3 3 10 1 1^ 17.1 

U 3 3 3.7 

Total 18 61 3 82 

Percent 22.0 7h,k 3.7 ^^'^ 



Table Z.k 



Conduct 
Expectation, 
Time 1 



1 
2 



Conduct Mark, Time 1 
1 2 Z k 

23 1^ 7 1 



10 



13 



Total 



35 3h 



Total Percent 



12 



26 
6 

5 
82 



5h.9 
31.7 
7.3 
6.1 



Percent 



h2.7 hi, 3 lh,6 1.2 



100.0 



expected 28^ of the time by chance, 33^ actually occur. Children were 
pptimistic about arithmetic to start with for 6l^ expected an A. The 
outcone must have been disappointing, however^ because the teachers are 
noticeably less generous with A's (22^ of graces are A's in arithmetic). 
Thirty- seven children (U^)—or almost half the toted— do not do as well 
as they expected, and the pendency to do worse rather than better than 
expected is significant (Xj " 6.^6, p < .0^).^ 



Conduct 

Teeushers apparently are most lenient in their grading of conduct- 
giving about Q3% A's and B's and dividing marks about equally between 
the two. It is about twice as harl to get an A in arithmetic as in 
conduct, with reading standing in between. Again the amount of matching 
between children's expectations and their marks is not significant. Al- 
though there is some tendency for es^ec tat ions to exceed marks when 
disagreement uccurs, the tendency for over- compared to under-estlmation 
is not statistically significant. 



Discrepance Between Expectations 
and the Report Card at the End of First Grade 



Reading 

The year end report card must also have been suxprising to children 
(Table 3.^). Less than half {h3%) got the mark in reading they e^qpected 
and this proportion is within the reuige of chance. Of those 35% who 
received a mrk different from what they expected 72% did not do as well 
as they hoped. Of the remainder almost all {2k%) were children who 
expected a B bub got an A. The tendency |or children to expect a higher 
mark than they received is significant (Xi « 8.69, p < .01). 

If anything, children's e^ectations for A's and B's are more 
polarized on the year end report card than on the first— 98^ now expect 
a B or an A cooopared to 89^ earlier. This means that even though more A's 
are actually awarded by the teacher {k&ji, vs. 3^^ awarded earlier) children's 
expectations have increased enough in the interim so that about the same 
degree of negative feedbeick ensues. 



values calculated for the comparison of cases above and below the 
main diagonal have not, in general, been corrected for continuity within 
this report. Since the ejected values are quite large (20 or more) such 
a correction would have a negligible effect. (See G. Snedecor and W. 
Cochran, Statistical Methods ^ Sixth Edition, pp. 209-213 for a discussion 
of the relationship of the correction of this type to the usual 
continuity correction for the normal approximation to the binomial.). 
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Reading Mark, 
Time 2 



Table 3.3 

Reading Expectation, Time 2 

1 2 3 k Total Percent 

1 28 11 1 1*0 47.6 

2 20 10 1 31 36.9 
3^8 12 lit. 3 
hi 1 1.2 

Total 52 30 1 1 8U 

Percent 6I.9 35.7 1.2 1.2 100.0 



Table 3*6 

Arithmetic Expectation, Time 2 
123k Total Percent 



20 



13 



3h 



U0.5 



Arithmetic 
Mark, Time 2 



20 18 



^♦5 



53.6 



6.0 



Total 1+3 



32 



8U 



Percent 51.2 38. 1 7.1 3.6 



100.0 
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Conduct Mark) 
Time 2 



Table 3.7 

Conduct Expectation, Time 2 

1 2 3 4 Total Percent 

1 23 5 2 30 35.7 

2 23 10 if 37 ^^.0 

3 k 6 2 1 13 15.5 
If 1 2 1 U U.6 

Total 51 23 9 1 Bh 

Percent 6O.7 27.^* 10.7 1.2 100.0 



Arithmetic 



The narks given in arithmetic are conaiderahly higher on the second 
report card— over ^ A's vs. 22?t A's earlier. The condslned number of B's 
and A's is about the same on the two occasions, however. There is 
noticeably less pessimism on the part of the children at the end of the 
year conpared to midyear for 11% fear the lower two grades whereas earlier 
21% harbored these fears. There has also been a reduction of extreme 
optimism?-- 51% now look for A's conpared to 6l% earlier. For arithmetic 
the net effect is to produce a within-chance amount of correspondence 
between marks and expectations and departures about equally in the overly 
optimistic and overly pessimistic directions. The correspondence between 
expectations and marks in arithmetic on the second report card, in fact, 
looks almost random. 



Conduct 



For conduct there is likewise no significant matching between children's 
expectations and marks received (U2% observed vs. 35% expected). Of more 
interest is the marked asymmetry in terms of marks failing to attain 
expectation level— UU% of the children get a lower mark than they expect 
which iigplies that when esgpectations and marks disagree, there is a 
highly significant tendency for marks not to equal expectations (x =12.76, 
p < .01). (For the first report card matching was within chance expectancy 
and there was a slight but not significant tendency for marks to be less 
than ea^ected.) Teachers are marking somewhat harder in conduct on the 
second report card than on the first— 7% less A's and 5% more C's and D's. 

In summary at both midyear and year end children's expectations in all 
three areas show only a chance level of matching with the marks they 
receive . There is a consistent pattern (in all areas at both midyear and 
year end) of children's ea^ectations exceeding their marks if the two did 
not match but this only attains conventional significeuice levels for mid- 
year arithmetic, and year end reading and conduct. 



Discrepances Between Expectations 
and Report Cards Over the Second Grade 



Midyear, Second Grade 



By the middle of second grade the child has received two second-grade 
report cards. Only the latter one (given at midyear) was examined. 



Reading 

As was true throughout first grade, in reading there is no significant 
matching between marks and children's expectation levels. As was true at 
midyear of first grade (but not at the end of first grade) there is no 
significant tendency toward over-optimism or over-pessimism in children 
whose hopes and marks differ in reading. 
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Table 3.6 



Reading tfork, 
Time 3 



Reading Expectation, Time 3 
12 3^ Total 



12 
18 

7 
2 



17 
22 

7 
1 



Total 39 ^7 



1 
6 
1 



8 



1 
1 



31 
U7 
15 
3 

96 



Percent 
32.3 
1+9.0 
15.6 
3.1 



Percent 40.6 1+9.0 8.3 2.1 



100.0 



Arithmetic 
Mark, Time 3 



2 
3 
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Table 3.9 

Arithmetic Escpectation, Time 3 
1231+ Total 



9 
30 
8 



25 



Total '+7 3p 



6 
1 



3 
5 



8 



Percent hb.'j 36.I 7.2 8.2 



1^3 



3-11 



11+ 

61+ 
18 
1 

97 



Percent 
1I+.I+ 
66.0 
18.6 
1.0 



100.0 



t 



Conduct Mark) 
Time 3 



Table 3.10 

Conduct Expectation, Time 3 

1234 Total Percent 

1 18 13 2 33 3^.0 

2 19 25 7 2 53 5^.6 
3^132 10 10.3 
U 1 1 1.0 

Total 1*1 39 13 h 97 

Percent h2.3 1^0.2 13,h k,l 100.0 



o 
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Arlthmetlo 



In arithmetio likevise there; is no significant matohlng at the middle 
of grade 2, continuing the state of « affairs noted throughout grade 1. 
There is a noticeable tendency for narks to be less than expected, however^ 
for kki, of the children receive a mark lower than they had hoped for. 
This may be a consequence of a noticeably increased severity in marking. 
Whereas 9k% of children had received B's and A's in arithmetic at the end 
of first grade, and kl% of these were A's. by the middle of second grade 
80^ are receiving B's and A's with only 1^ of these A's. A large number 
of children (31^^) expect an A but receive a B. This is the chief factor 
responsible for ^signif leant asymmetry in Table 3>9 whereby e^rpectations 
exceed markb (x = 9.29, P < .01). (This kind of asymmetry prevailed at 
midyear of grade 1 but not at the end of grade 1.) 



Conduct 

In conduct there is significant above chance matching {kTfa receive 
the mark they expect, z = 2.00, p < .05) and this is in contrast to lack 
of matching noted at both mid- and end-of-year in grade 1. Whereas at the 
end of grade one there was a significant tendency for the non-matching 
students to do worse than they expected, at the middle of second grade 
both positive and negative discrepances occur to about the same extent. 
Teachers give a few more B's and a few less C's and D's at the middle 
of second grade compared to the end of first grade. 



End of Second Grade 
Reading 

At the end of second grade for the first time there is a highly 
significant match between expectations and marks in reading (z = 2.86, 
p < .01). The correspondence comprises of the cases. Children's 
es^ectations are a little higher than they were at the middle of second 
grade, but the average mark awarded has also risen. About 92^ of the 
children get A's or B's vs. 81^ earlier. 

Fewer of the children expect A's than was true at the end of first 
grade (U6^6 compared to 62%), although the overall expectation for A's and 
B's is practically universal at both times (98^ at the end of first grade 
and 9k% at the end of second grade). 



Arithmetic 

There is likewise a significant match in marks expected and received 
in arithmetic at the end of second grade, for 37% receive exactly what 
they expect (z =23.72, p < .01). Of the remainder a significant number 
(31^ vs. 12%) (x, = 8.U0, p < .01) do worse than they had hoped. Thus 
compared to the middle of second grade, children's ability to forecast 
has improved (there was no significant matching then or earlier in first 
grade). The errors in forecasting are still occurring on the optimistic 
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Table 3.11 



Reading Mark, 
Time h 



2 
3 



Reading Expectation, Time h 
1 2 3 k Total Percent 



32 



13 



18 
2h 
6 



Total 45 48 



50 
41 
8 
99 



50.5 
Ui.U 

8.1 



Percent 45.5 48.5 5.1 1.0 



100.0 



Table 3.12 

Arithmetic Expectation, Time 4 
1234 Total Percent 



Arithmetic 
Mark, Time 4 



16 



24 



34 



6 



23 
61 
14 



23.2 
61.6 
14.1 



Total 4l 47 



99 
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1.0 



Percent 4l.4 47.5 9.1 2.0 



100.0 



Conduct Mu.k, 
Time k 



Table 3.'-3 

Conduct Expectation, Time h 

123 Total Percent 

1 24 17 1 k2 k2.k 

2 ll* 25 5 hk Uk,k 
3272 11 11.1 
hi 1 2 2.0 

Total hi h9 9 99 

Percent hl.h i*9.5 9.1 100.0 
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side— odds of about 10 to k for optimism, continuing a trend noted in the 
middle of second grade and the middle of first grade but not at the end 
of first grade. 

The "over-optimism" may be more a function of different marking 
standards being applied than of changes in expectations, however. That 
is, children's forecasts may be more consistent than teachers' marking 
(or ultimately even children's perfonaance) . At the end of first grade 
only 6^ of children receive less than a B in arithmetic whereas at the 
two subsequent semester points 20^ and 1^ receive less than a B. These 
changes in grading standards over time approximate the picture for reading. 
There about l6^ get less than a B at the end of first grade, with 19^ 
and 8^ showing less than a B at the two second grade points. 



Conduct 

In conduct by the end of second grade there is a significant corre- 
spondence— 52^ matches— between expectations and marks (z = 2.23, p < .05). 
Of those whose marks do not match, over-optimists and over-pessimists 
are about equal in number (25% and 23% respectively). Teachers give 9ji 
more A's at the end of second grade than in the middle. The overall 
division between hlgli (A and B) and low (C and D) marks is fairly constant 
over the entire 2-year period. There are shifts, as noted, between the 
relative: proportions of A's and B's. 

In sunanary the second grade displays a marked change in the degree 
of matching between marks and children's expectations. Throughout grade 
one matching occurred at only chance levels but by the end of grade two 
all the marking areas displayed significantly above chance levels of 
matching . This indicates some fundamental change has transpired over 
the course of the second year. Some causal relationship has begun to 
display its "effects, mils suggests the end of grade one and all of 
grade two should provide fertile ground for beginning to sort out the 
caused relations linking the various variables of Interest. Hhe fact that 
the relationship developed in conduct before it did in the substantive 
academic areas may be a function of the child's differential monitoring 
and control capabilities regarding ccnduct. 

Over the two year span only in arithmetic do children display a 
continual tendency to over-optimism regarding their mark . This occurs at 
3 out of the k observation periods— the exertion being time 2. Reading 
and conduct display no over-optimism in 3 of the k observation periods — 
the exception again being time 2. Changes in the mark distributions 
assigned (whether it be due to changes in the dlfflciaty of subject matter, 
changes in teacher's marking severity, etc.) influence over - or under - 
optimism as much as do expectations themselves, so the degree of over- 
or under-cptimlsm might be e3Q>ected to be reasonably variable (as it is). 
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Hew Do Children's Expectations Change Over First Grade Year?^ 



Reading 

If hope springs eternal in the human breast, it does so also in the 
very young breast for the marginal distribution of children's reading 
expectations at the end of the second semester of grade one is more skewed 
than the simileur distribution earlier in the year. Whereas early in the 
year 12 children look for 3's or k'a, now only 3 children look for such 
low grades. Fouf of those formerly pessimistic 12 now even look for A's. 
In addition the lopsidedness between A's and B's at the end of the first 
semester (about twice as many expect A's as B's) diminishes very little 
despite the teachers' awarding of marks in reverse ratio. The first 
marks awarded include noticeably more B's than A's in about a 3:2 ratio. 
Children thus have unrealistically high eagpectations about A's at the 
start but the awarding of marks appears to do little to change these 
unrealistic ea^pec tat ions, so far at least. The awarding of marks (or 
possibly some other contemporaneous factor) does appear to dissipate the 
worst fears of the young pessimists, however. The exaggeration of the 
skewing of the e;gpectation marginal is not quite ^ough to produce a 
significant asymmetry around the main diagonal (Xj = 3.56, N.S.). 

More than half (60^) of the children hold the same expectations at 
two points in time. This agreement is highly significant (z = 2.77, P < .01). 



Arithmetic 

There is less consistency over time in children's expectations for 
arithmetic, less than half {k7%) showing perfect consistency. This amount 
of consistency is significant above chance levels, however (z s 2.05, 
p < .05). There is a moderate increase in optimism among those {2k%) who 
expected to do tb*; worst (3's and U's). Interestingly ten of the 20 who 
earlier expected a C or a D expect an A at the end of the year. There are 
some marked counter movements as well— 6^ of the children display e^ecta- 
tion declines of 2 or 3 units. Of the large number (59?&) expecting I's 
at the end of the first semester, over half are still looking for I's at 
the end of th^ second semester. As with reading, there is no significant 
asymmetry ( Xj = 0.36, W.S.). 



Conduct 

Perhaps because many good marks in conduct were given, eacpectations 
in Tchis area are generally moving jji a more positive direction. There is 
little pattern in Table 3.l6, however. Children's expectations at the two 
points in time do not match any better than vovld be es^ected by chance 
and there is no significant tendency toward increasing or declining 
e3Q>ectations in those not matching. Of the lU children es^ecting a C or 
a D at midyear, 6 or k3% come to expect an A at year end. 



^Two report cards are issued In the second semester of grade one. 
Only the second report curd~the one Issued at the end of the semester 
(and iicncc also at the end or f.rude one)— is discussed here. 



Table 3.1^ 



Reading 
Expectation, 
Time I 



1 

2 

3 
k 



Reading Expectation, Time 2 
12 3 4 Total Percent 



37 
12 
1 



Total 53 



10 

13 
k 



29 



Percent 62. U 3U.1 2,k 1.2 



k6 
25 
5 



85 



56.5 
29. ^* 
5.9 
8.2 



100.0 
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Arithmetic 
Expectation, 
Time 1 



1 
2 

3 
k 

Total 
rercent 



Table 3.15 

Arithmetic Expectation, Time 2 
1231* Total Percent 



28 
k 
8 
2 



18 
9 
5 



32 



U9.U 37.6 
3-18 



2 
1 

3 
1 



2 
1 
1 



8.2 



U.7 



50 
15 
17 
3 

85 



58.8 
17.6 
20.0 
3.5 



100.0 



Table 3.l6 

Conduct Expectation, Time 2 

12 3^ Total Percent 

1 30 12 3 k3 52.9 

conduct 2 18 5 3 26 30.6 
E}Q)ectatlon, 

Time 1 3 3 3 2 8 9.U 

i» 3 1 1 1 6 7.1 

Total 5^ ,21 9 1 85 

Percent 63.5 2U.7 10.6 1.2 100.0 
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In smmaary . while expectations in the substantive areas of reading 
and arithmetic show significant stability from midyear to year end, conduct 
does not. In all three areas a substantial proportion (about UO^) of those 
expecting the two lowest marks (C's or D's) at midyear come to expect the 
highest mark possible (an A) by year end. The majority of those expecting 
C's or D's at midyear expect at least ati average mark (B) by year end in 
all three areas. Expectations show no evidence of being pexnanently 
depressed if the child's first expectation wati for failure. 



How Do Children's E^gpectations Change from Grade 1 to Grade 2? 



Reading 

Well over half (60^) of the children have the same es^ctation for 
the end of the first semester, grade 2, as they held at the end of first 
grade. This is a highly significant match (z « 2,31, P < .0^). Of those 
who shift, twice as many (27^) shift down as up (13^), but this does not 
attain conventional significance levels (Xi = 3*23) • The decline in the 
number of (^hildren expecting an A (^ vs. 62^) is what is producing the 
observed X . 



Arithmetic 

The same amount of consistency between esgpectations at the end of 
first grade and second grade midyear expectations (39^) for arithmetic 
occurs as was noted for reading. This amount of matching is above chance 
levels {kl%) and is highly significant (z « 3*33> P < ^Ol). There are, 
however, almost equal numbers of children shifting up and down. Also the 
marginals at each time point are almost identical. Roughly twice as many 
children look for C's and D's in arithmetic as in reading, but the over- 
all percentage (10^) is still small. 



Conduct 

There is significant matching in e3Q>ectations for conduct at the end 
of first grade and midyear of second grade (z « 2,h3, p < .0^). The 
teacher's bearing down on conduct grades at the end of grade 1 may have 
led to a decrease in children's optimism about conduct grades near the 
middle of grade 2. Whereas at the end of first grade, about 63^ looked 
for an A, k2% now expect an A. Four more children expect C's than 
formerly, and 2 more expect D's. There is a significant overall downward 
shift in expectations for conduct grades (x^ =° 10.33) over this time 
Interval (12^ move up and 37^ move down). ^ 
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Child's 
Reading 
Esgpectation, 
Time 2 



Table 3.17 
Child's Reading Escpectation, Time 3 





JL 


Z 


3 


Total 


Percent 


1 


30 


17 


1 


kS 


61.5 


2 


7 


17 


3 


27 


34.6 


3 


1 


1 




2 


2.6 


k 


1 






1 


1.3 


Total 


39 


35 


k 


78 




Percent 


50.0 




5.1 




100.0 



Table 3.16 



Child's Aritlmetic Expectation, Time 3 
12 3 4 Total Percent 





1 


27 


11 




2 


ko 


51.3 


Child's 


2 


13 


15 


1 




29 


37.2 


Arithmetic 












Expectation, 


3 


1 


2 


1 


1 


5 


6.4 


Time 2 
















k 




1 




3 


k 


5.1 




Total 


kl 


29 


2 


6 


78 






Percent 


52.6 


37.2 


2.6 


7.7 




100.0 
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Table 3*19 



Child's Conduct Expectation, Time 3 







1 


2 

mm 












1 


26 


19 


k 






62.8 


Chad's 


2 


6 


10 


3 


2 


21 


26.9 


Conduct 














Expectation, 


3 




2 


If 


1 


7 


9.0 


Time 2 
















If 


1 








1 


•1.3 




Toted 


33 


31 


11 


3 


78 






Percent 


if2.3 


39.7 


llf.l 


3.8 




100.0 



ERIC 



.* ... y 



3-22 



Hov Do Children's Esgpeotations Change Over the Second Grade Year? 



Reading; 

There is some mild increase in children's hopes about reading over 
the second grade year as seen In Table 3-20— whereas lof, look for C's and 
D's In the middle of the year, that percentage has shrunk to about fiSfc by 
th^ end of the year. The shifts upward (ZQffi do not differ significantly 
(X, =0.82, K.S.) from those downward (27%). 

There is significant consistency in reading expectations over the 
second grade year, with 53?t matching between midyear and end-of-year 
second grade (2 » 1.95, P = .05). The consistency over second grade 
however is less , although not significantly so, than that noted over the 
first grade year (60^) , or between the end of first grade and the middle 
of second (60%). 

Perhaps the most interesting thing is the change between the earliest 
expectations and later ones in terms of extreme pessimism. At first 
sampling 8^ of the children expected the lowest grade. As time goes on, 
practically no one ever again has such low hopes (the values ranging from 
1 to 2%) . Another rather surprising fact is the fluctuation from one time 
to the next for a few children. One would think that if a child's expec- 
tations did not remain stable, they would probably shift up a notch or 
down a notch. From the middle to the end of first grade 896 of children 
shift two or more steps, and from the middle to the end of second grade 
7% of the children do so. These percentages do not look large but with 
a sample of around 100, they imply that about 2 children in every class- 
room have extremely labile expectations even within a single year. 



Arithmetic 



In arithmetic the consistency in expectations (kSji) over the second 
grade does not significantly exceed chance (399t) (z « 1.1*3, N.S.). This 
contrasts with a strong consistency (595t, P < .01) between the end of 
first grade and the middle of second, and a lesser but still significant 
consistency over the first grade year ih7%, p < .05) . 

There are shifts upward and downward in about equal numbers (2d£ and 
27% respectively). 



Conduct 

In conduct the consistency over the second grade year {k&ji) does not 
significantly exceed chance (39?t) although it is not far from the 5^ i.evel 
(z = 1.87). There is no significant asymmetry up or down in Table 3.25. 
This is the same state of affairs as that prevailing over the first grade 
year. Surprisingly, in view of this lack of consistency within both years, 
there is some consistency between the end of first grade and the middle of 
second (51%, z « 2.U5, p < .05), This was acconpanied by a significant 



Table 3.20 



Reading 
Expectation, 
Time 3 



2 

3 



Reading Expectation, Time k 
1 2 ^ k Total Percent 



22 
18 
3 



16 
26 

2 



1 
1 
2 



Total kk k3 



39 
U6 

7 
2 

9»* 



In. 5 
48.9 

2.1 



Percent U6.8 1*7.9 4.3 1.1 



100.0 



Table 3.21 

Arithmetic Expectation, Time k 

12 3 4 Total Percent 

1 23 19 3 1 46 48.9 

2 14 18 2 34 36.2 
Arithmetic ^ ^ 

Expectation, 

Time 3 3132 6 6.4 

4 1 6 1 8 8.5 

Total 39 46 8 1 94 

Percent 41.5 48.9 8.5 1.1 100.0 
ER|c ,7'; 3-2^ 



Table 3.22 

Conduct Esgpectatlon, Time k 

123 Total Percent 

1 20 16 3 39 hl,3 

conduct 2 11 2U 3 38 Uo,k 
Expectation, 

Time 3 3 8 1 13 13,8 

^211 k k,3 

Total 37 k9 8 9U 

Percent 39. ^ 52.1 8.5 100.0 
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d|innirard adjustment in esgpectatlons (12^ ioov«d up and 37$ moved down, 
Xj ~ 10.^2, p < .01). Most of the down shift was accounted for by a 
reduction from expecting A's to B's. 

Ih summary , if we Intexpret consistency or above ohai?ce levels of 
matching between expectation over a time interval as being indicative of 
some causal connection between the expectations at the two times (either 
as the earlier being a direct cause of the later, or both being the 
result of a stable third factor such as ability or parental expectations) 
we see the data on levels of significance for both grades 1 and 2 provide 
no simple answer regarding such causal relations. The pattern of the 
significance of matching of expectations from one time to the next is as 
follows: 



Tl-2 T2-3 T3-U 

Reading .01 .0^ .0$ 

Arithmetic ,05 .01 N.S. 

Conduct N.S. ,05 N.S. 




All the non- significant cases display deviations in the direction of above- 
chance matching and almost reaching significance in the case of T3-4 
conduct, (ihe chance es^ectations are exceeded by the following percentage 
points for reading, arithmetic and conduct respectively: Tl-2, l5, 9, 3; 
T2-3, Ik, 16, 12; T3-^» 10, 7» 9«) The consistency of the above chance 
matching, nine cases out of nine being in one direction <p < .6i on a 
binomial test), suggests some causal relation does exist . 

It should be noted that children with low es^ctations do not provide 
an overly abundant iqput of matched expectation cases. That is, those with 
low expectations do not show exceedingly strong tendencies for holding the 
same lew e^g^ectation over time. In general the majority of those esq^cting 
C's and D's at any initial time, hold expectations for average or top 
marks by the second time. (T2-3 arithmetic and conduct expectations are 
exceptions where 56^ and 63^ respectively maintain low expectations over 
time.) 

Those children whose expectations did not match at the consecutive 
times , in general , show no tendency to either lower or raise their expec- 
tations . Conduct expectations between end of grade one and middle of grade 
two are an exception. Here the asynmetry was toward a decline in expecta- 
tion. 



Hew Expectations Change fiy Expectation Level 

(Omitted) 



See Chapter 5 for first year results. 
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Hew Marks Conpare in Different Areas 



Results from the coiqparison of marks In different areas are most 
meaningful when looked at over the whole tljue span available (two 3rears) 
and over all possible combinations of mark pairs. Two questions arise: 
(1) How consistent are children's marks in any two areas (a matching 
problem)? (2) For the cases that do not match, do children's marks in 
one area generally exceed marks in the other area (an asymmetry problem)? 



Consistency or Matching 

The significance of the matching between marks in the different areas 
is sunaarized in the following table. 



Tl T2 T3 Tk 

(N = 85) (N = 86) (N = 102) (N = 103) 

Reading-Arithmetic .01 .01 .01 .0^ 

Arithmetic-Conduct .0^ N.S. N.S. N.S. 

Reading-Conduct N.S. N.S. .01 .01 

(z « l.eh) 



The picture is quite clear. The academic areas (reading and arith- 
metic) show a consistent above chance level of matching. Marks in conduct, 
the behavioral area, generally show no significant matching with arithmetic 
mark (Tl being an exception where a weak and barely significant (z = I.98) 
relation occurs). Conduct marks, on the other hand, show a trend toward 
becoming consistent with reading marks. 

The level of chance matching; between marks in any two areas is 
generally near 39^ while the significantly abora chance matches range from 
k9f) to 6k%, Considerable variation between children's marks in any two 
areas remains even when above chance levels of matching are present. 

When above chance matching occurs it seems to occur at all mark levels. 
There is no apparent tendency for those with high, average or low marks 
to differentially £u:count for excess matches. Where excess matching 
exists children with high, average and low marks all seem to have contri- 
buted to the excess. 



Asymmetry 

The following table presents the pattern of significance associated 
with asymmetry among unmatched marks. In this table R > A is an 
abreviated notation indicating the asymmetry is of the form where reading 
markn are higher than arltlimctlc marks. 
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Reading-Arithme^ ic 
Arithmetic-Conduct 
Reading-Conduct 



Tl 
N.S. 
N.S. 
N.S. 



T2 T3 T^f 

N.S. .05(R>A) .01(R>A) 

.05(A > C) .01(0 > A) .01(0 > A) 



N.S. 



f.S. 



N.S. 



The four significant values in the upper right comer of the table 
arise because of the relatively hard marking of arithmetic at both times 
during the second year. This means that in coaparisons of reading and 
conduct with arithmetic the unmatched cases tend to show the arithmetic 
mark as being the lower. The remaining significant value is a product of 
fairly easy arithmetic marking and fairly hard conduct marking at the end 
of the second year. Neither of these by themselves, however, are severe 
enough to produce a significant relationship with reading which is 
moderately marked. 



How Consistent are the Children's Mirks over Time? 



For the middle-clase school there are three time periods for which 
mark changes are considered, namely, changes during the first year, between 
the end of grade one and the middle of grade two, and during the second 
year. The significance of the matching between marks over the time inter- 
vals is as follows. 



Tl-2 T2-3 T3-U 

(N = 78) (N = 82) (N « 103) 
Reading .01 N.S. .01 
Arithmetic .01 N.S. .01 
Conduct .01 N.S. .01 



Tlie pattern is both obvious and somewhat startling. In all three 
areas children's marks within both the first and the second grades show 
significant above chance levels of consistency or matching. The chance 
levels of matching range from 36^ to h9^> while the observed significant 
matches ranged from 5% to 73%. (The observed Insignificant values ranged 
from kl% to U5% matches.) Clearly, within each grade the marks the 
children receive are quite consistent. About two- thirds of the children 
receive identical marks at the two observations in each year. 

The matching of marks between the end of grade one and the middle of 
grade two shows a completely different picture. Here, none of the areas 
shows a significantly above chance consistency. 

Substantively there are two styles of explanation for this finding. 
First, because in this school each child's performance is marked 
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according to his own ability (their narks approxiaate the children's 
efforts) the finding can be restated as follows: Any given teacher 
Is quite consistent In his/her evaluation of a child's scholastic 
efforts but teachers differ with each other In their evaluation of 
effort. The second estplanatlon Is that of a year X child Interaction, 
Each school year can be envisaged as a set of environmental variables 
for each child that are largely consistent throucthout a year yet which 
change between years (e.g., teacher personalities, presence of a child's 
best friend In his class, etc.). If children's efforts Interact with 
such variables (different environments stimulating or depressing different 
children) the observed pattern of mark consistency would be expected. 
The above alternatives reflect one's choice of a model that postulates 
variation In teacher's marking (with children expending stable efforts) 
or variation in children's efforts (with reliable Inter-teaoher marking). 
The real world probably evidences a mlnlbure of these "ideal types". 

For the cases that do not display matching of marks at the 
coiqparison times, it can be asked if either high or low marks tend to 
predominate. The following table summarizes the significance of , 
asymmetry in the comparison of marks at the various times as tested by Xi • 
(Use of 2 > 1 etc. in this table indicates marks at time 2 are higher 
than marks at time 1, etc.) 



Tl-2 T2-3 T3-4 

Reading N.S. N.S. .01(U>3) 

Arithmetic .01(2 >1) .01(2 > 3) .01(4 > 3) 

Conduct N.S. N.S. N.S. 



Conduct marks remain constant (on the average) throu|(hout the first 
two years. Arithmetic marks increase over the first year, indeed they 
are very high at year end— Ul^ get A's, the highest mark possible. The 
drop in arithmetic marks from the end of grade one to mid-grade two is 
more than "getting back to a reasonable mark distribution" for the 
arithmetic marks at the later time have the lowest average mark ever 
assigned in any of the areas (2.07)* When arithmetic marks again rise 
over the second year they do not rise above the level attained the first 
year. The rise in conduct marks over the second year to the bluest 
average observed (l.6o) places the next year's teacher in a dlfficiat 
"personnel manftgement" position. This teacher Is forced to nark coqpari- 
tively hard to produce a reasonable mark distribution and extremely hard 
comparatively if he/she wants the children's marks to be able to "show 
come improvement" over the third grade year. A child's narks cannot 
improve if they are already at the top— even if they got to the top by the 
last year's teacher's lenient year-end marking. 

In all areas teachers generally assign few year end marks that differ 
by more than one point from the midyear mark assigned. The percentages 
are 1%, 1^ and 3% for reading, arithmetic and conduct respectively in 
grade one and 2^, 1% and 1^ respectively in grade two. Between years one 
and two (times 2 emd 3) there are a fev more changes of two or more points 
in marks (5%, k% and hi respectively) but these are still quite small values. 
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How Expectations Compare in Different Areas 



At End of First Grade 

There Is a surprising negative association between expectations for 
marks In reading and marks in arithmetic at the end of first grade. Only 
32^ gave the same expectations for the two areas, a failure of matching 
that is highly significant (z = -2.6l, p < .01). (An average of 
matching would be es^ected by chance.) This is contrary to the matching 
in same-subject expectations over time. For Instance there is significant 
matching for first graders between first and second semester expectations 
in reading (60^, p < .01), and in arithmetic {kT%, p < .05), although not 
in conduct. 

There is a degree of matching that would be expected by chance between 
reading and conduct expectations, and between arithmetic and conduct 
esgpec tat ions at the end of first grade. None of the cooffarlsons of 
es^ectations between areas show expectations in one area to significantly 
exceed es^ectation in another. 



At End of Second Grade 

There is no significant matching between children's expectations for 
reading and for arithmetic at the end of second grade {kli match). (This 
lack of association is to be contrasted with the significant negative 
association at the end of first grade.) There, furthemore, is no 
significant tendency for expectations in one area to be higher than the 
other. 

There is also only the degree of matching that would be predicted by 
chance between es^ectations in arithmetic and conduct and between those 
in reading and conduct. For neither of these comparisons is there a 
tendency for children whose expectations do not match to have expectations 
higher in one area. 

It is of interest to note that while a large majority of es^ctations 
in different areas fall within one unit of each other, there are a reason- 
able number of cases where this is not true. For reading and arithmetic, 
arithmetic and conduct, and reading and conduct the respective percentages 
of expectations differing by two or more points are 6ff,, 10^6 and 11^ at the 
end of first grade, and 3%, 7%, and 5^ at the end of second grade. Very 
few children (an average of about 1.5^) hold low eaqpectations (i.e., for 
C's or D's) in any two areas. None hold low e3Q>ectations in all three 
areas at the end of grade two. This suggests that in the middle-class 
school at least, if ameliorative intervention via raising expectations 
was to be engaged in it should be subject and child intervention as 
opposed to simply child oriented intervention. One would not attesq;>t to 
"raise the child's expectations" but rather "raise his/her reading or 
arithmetic or conduct es^ectation", as the other expectations are usually 
already at average or above average levels. 
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What is Inqpact of First Mark on Expectation Later? 

Reading 

There is a surprising lack of consistency between what the child 
expects at the end of grade 1 with his mark earlier in the year. Less 
than ^ of the children esepect again what they got earlier, and the amount 
of matching does not exceed what would be expected by chance. For the 
cases that do not match (51^t) almost all deviate in an optimistic direction. 
. Thus almost half of those es^ecting a 1 got a 2 last time. About 10^ of 
the students look for a decrease, all but one are in the group who expect 
a 2 this time but who received a 1 l^st time. Fully ki^ look xor a 
higher grade. (This asymmetry is highly significant; » 16.10, p < .01). 
If the teachers were to conform to students' hopes they would have to 
award 63^ of the students A's.' (They actually had given over one- third 
A's earlier and do give k7% A's at year end.) 



Arithmetic 

There is even less correspondence between the child's hopes at the 
end of the year for arithmetic and his earlier mark than was true for 
reading. The correspondence between the first mark in arithmetic and 
hopes at the end of first grade does not exceed £ui amount predicted by 
chcuice. Also, the asymmetry is likewise in an optimistic direction, 
although here not quite statistically significant (xf * 3.**5). An 
interesting fact about this relationship is that some children {%) show 
a correspondence between low marks and hopes for high marks (I's and 2's). 
Conversely eleven children (l3?t) got 2's earlier but expect to get 3's 
and 4's at the end of the year. The teachers actually had not given any- 
one a "U" in either arithmetic or reading and children do not look for 
U's in reading or conduct. Arithmetic seems to be a subject which provokes 
more anxiety (but this could be a manifestation of a small case base). 
Some children's fears and other's high hopes are not grounded in reality. 

Conduct 

There is a significant amount of matching (z = U.l6, p < .01)— 36^- 
between what the child expects for his mark in conduct and the mark he 
received first in this subject. There is also a significant (Xj = 13. 
p < .01) tendency toward optimism in those whose escpectations depart from 
the earlier mark, for of those expecting their mark to change, expect 
it will rise. 



What is the Impact of Later Marks on Expectations? 



Tlie degree of matching between the mark in reading at the end of first 
tirade and the mark expected in t.he middle of second fj;rade is again within 
cliunce expectancy. About k'rt> expect to receive what they received earlier. 
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Table 3.23 

Child's Reading Expectation, Time 2 
1 2 3 Total Percent 





1 


22 


7 




29 


35.4 


Reading 


2 


25 


17 


1 


if3 


52.1^ 


Mark, 










Tine 1 


3 


5 


If 


1 


10 


12.2 




Total 


52 


28 


2 


82 






Percent 


63A 


3^*.l 


2.U 




100.0 



Table 3.2U 
Child's Arithmetic Expectation, Time 2 







1 


2 


3 


k 


Total 


Percent 




1 


11 


7 






18 


22.0 


Arithmetic 


2 


27 


22 


7 


k 


60 


73.2 


Mark, 














Time 1 


3 


1 


3 






If 


h.9 




Total 


39 


32 


7 


k 


82 






Percent 4?. 6 


39.0 


8.5 


k.9 




100.0 
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Table 3.2^ 

Child's Conduct Expectation, Time 2 
123 Total Percent 





1 


30 


2 


2 


3k 


U1.5 


Conduct 


2 


18 


13 


3 


3k 


41.5 


Nark, 














Time 1 


3 


k 


6 


3 


13 


15.9 




k 




1 




1 


1.2 




Total 


52 


22 


8 


82 






Percent 


63.U 


26.8 


9.8 




100.0 
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and the departures in optimistic or pessimistic directions are not very 
different (30^fc and 23^6 respectively) . (After the first mark there was 
a similar lack of consistency between eacpectations at the end of the year 
and midyear marks, but hopes were notably in kn optimistic direction.) 

The correspondence between marks in reading in the middle of the second 
jrear and expectations at the end of that year still does not exceed what 
would be predicted by chance, given the marginal distributions. But now 
there is a resurgence of optimism. Significantly more (37^) have eacpecta- 
tions higher rather than lower {lk%) than their marks (x, « 9.68, p < .01). 
It thus appears that the early optimism, which appeared to subside, 
surfaces again in second grade. 



Arithmetic 

In arithmetic there is likewise no significant matching between the 
mark at the end of the first grade euid the expectation held in the midyear 
following. There is some tendency toward optimism, but it is not signi- 
ficant. This replicates the picture seen for the relation between first 
grade midyear marks in arithmetic and expectations later that year. 

Again there is failure to match significantly from midyear to the 
enj of second grade in arithmetic. There is, however, a highly significant 
(Xi = 18.96, p < .01) tendency for expectations to be higher than the mark 
received earlier, for kdffi, expect a higher mark whereas 11^ eaipect a lower 
mark. Although only 1% of children earlier had received a mark of "A", 
41% of the children look for that mark at the end of the year. 



Conduct 

In conduct there is not a significant degree of matching between the 
mark received at the end of first grade and the expectation at the middle 
of second grade, although there was significant matching of this sort from 
the middle to the end of first grade. Also the scatter above and below 
is now similar whereas earlier expectations were significantly optimistic. 

Between the middle and the end of second grade, significant matching 
again appears (585^) (z = 3.75, P < .01). Also optimism does exceed 
pessimism to some extent but not significantly. 

It is tempting to speculate that "conduct" is a teacher-specific 
activity and one which students feel they can effectively alter. Students 
may have to learn what a particular teacher regards as acceptable and 
unacceptable conduct, and they may expect little carry-over between first 
and second grade. The students' expectation of little carry-over between 
years Is substantiated by the insigi^ificant matching for conduct between 
times 2 and 3 but in this area students' expectations come to match the 
mark given at midyear (and this happens both years) which suggests 
students feel they cannot effectively alter their conduct. They adopt 
their previous mark as their present expectation. The data contradict 
the above "tempting speculation". 
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To sum UP ! The pattern of the results for all the cooQ>arisons of 
e3Q>ectations at times subsequent to mark assignments are presented in the 
following table. (M = matching^ A = asymmetry and again E>M means for 
the asymmetry expectations are higher than earlier mark. The marks are 
always observed at the earlier time period—expectations at the later time 
period* ) 





Times 1 


and 2 


Times 2 and 3 


Times 3 and k 




M 


A 


N A 


M 


A 




(N = 82) 




(N = 77) 


(N - 99) 




Reading 


N.S. 


.01(E>M) 


N.S. N.S. 


N.S. 


.01(E>M) 


Arithmetic 


N.S. 


N.S. 


N.S. N.S. 


N.S. 


.01(E>M) 


Conduct 


.01 


.0.1 (E>M) 


N.S. N.S. 


N.S. 


N.S. 



In reading and arithmetic children show no sign of adopting an earlier 
mark and maintaining that specific mark as their later expectation . 
Surprisingly, in conduct this does occur within grades one and two (but 
not between grades) . This is directly opposite to what would be expected 
if children felt they could effectively alter their conduct marks during 
grades one and twoi 

Those dedicated to an "adopting of marks as later e^gpectations" theory 
might suggest that the issuing of a report card between each of our 
"earlier mark" "later expectation" pairs renders this an insufficient test 
of the theory, the claim being that children may be showing an extreme form 
of "adopting marks" at each successive mark assignment. This suggests that 
only the previous mark assignment can be used to test the theory, not a 
mark assignment one removed from the time of the measurement of expectations. 
This claim has little foundation, however. It has already been reported 
in the dlRcussion of marV. consistency over time that marks show highly 
significant and substantial consistency within each grade. Therefore a 
child's mark that appears between the time of the initial mark determina- 
tion and the time of the expectation determination is very likely to be 
identical with the mark utilized in the comparison, thus removing the 
objection. (The case between years is not as clear— here no significant 
matching of marks occurred (l/3 to 1/2 of the marks still matched by 
chance) but this was not a matching between end of grade one and the first 
marks of grade two, as would be necessary to Justifiably make this 
argument.) 

The asymmetry that occurs is only present within grades one and two 
(not between the grades) and is always of the type where present expecta- 
tions exceed earlier mark. This is not surprising since expectations 
generally exceeded marks at all times. In particular the arithmetic 
asymmetry for times 3 and h is partially an artifact of the extremely 
hard arithmetic marking at the middle of grade two (time 3). This 
asymmetry would be expected if the children simply retained their previous 
expectations. There seems no basis on which to predict when expectations 
will exceed previous marks, other than that such an excess appears within 



grades. While asymmetxy in this type of table is a sign of unreallsm 
if e:Q>ectations drastically exceed previous marks, it can also be a sign 
of realistic striving if expectations moderately exceed previous marks. 
(The latter is probably the case for QTf> of those whose expectations 
exceed previous marks in the tables with significeuit asymmetry, since 
their expectations exceed their mark by only one point.) 



Effects of Feedback on M&rks and Eb^ectations 



Discussions of e:q?ectation-mark agreement in terms of marks received 
(midyear and end of year) and in terms of expectation level are found 
in Chapter 5 as is a discussion of feedback effects for both middle-class 
cohorts in first grade. In this section we will review briefly the 
findings on feedback for Cohort 1 as a prelude to discussion of feedback 
effects in second grade for that cohort. 

"Feedback" in this discussion will be called "positive," "neutral," 
or "negative" depending on whether the first mark received was higher 
than expected (positive^, the same, or less than expected (negative). 

In reading there was a strong tendency for first grade children to 
persist in the same expectation at year's end if midyear marks matched 
expectations or if marks were less than esg^ected. if marks exceeded 
expectations most {79i>) revised hopes upward, in arithmetic and conduct, 
there was a similar (but insignificant^ tendency except that children 
were less apt to persist in expectations if feedback was negative. For 
marks in buLh reading and arithmetic most of those whose marks rise were 
children who expected to do better than they actually did at midyear. 



Second Grade Marks 
Reading 

Table 3.26 presents the changes in marks that occurred between the 
middle of grade two (time 3) and the end of grade two (time k) following 
the various possible combinations of marks and expectations that occurred 
at the middle of grade two. The minor diagonal of this table is 
significantly (z = 3.38, p < .01) over-represented which indicates an 
above chance occurrence of cases where: marks move up if the child had 
earlier expected to do better, marks move down if the child had earlier 
expected to do worse, and marks remain the same if the child earlier got 
exactly what he es^ected. Reading marks respond more vigorously over the 
second grade to a positive discrepance (expectation exceeds mark^ than 
to a negative discrepance. Sixty per cent of the cases that showed an 
initial positive discrepance had marks that moved up in response, while 
only 8^ of the cases that showed an initial negative discrepance had 
marks that declined in response. (This is still above the chance level 
of 3% expected declines.^ 



Table 3*26 



Child Did B« S, W 
than He Expected to 
Do in Reading, 
Time 3 





Reading Mark Went 


U. S. D 


from 




up 


Time 3 to Time 4 
Same Down 




Better 


2 


22 


2 


26 


Sane 


8 


26 


1 


35 


Worse 


21 


Ik 


0 


35 




31 


62 


3 


96 



Table 3.2? 



Child Did B. S, W 
than He Expected to 
Do in Arithmetic, 
Time 3 





Up 


Time 3 to Time k 
. Same Down 




Better 


3 


15 


1 


19 


Same 


k 


29 


2 


35 


Worse 


13 


26 


1+ 


h3 




20 


70 


7 


97 
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Table 3*28 



Child Did B. S. W 
than He Esgpected to 
Do in Conduct > 
Time 3 





Conduct Mark Went 


U. S. D 


from 






Tloe 3 to Time 4 
Sane Doim 




Better 


3 


19 


k 


26 


Sasie 


7 


32 


7 


k6 


Worse 


11 


12 


2 


25 




21 


63 


13 


97 
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Table 3*27 Indicates that arithmetic shove a similar result to that 
In reading with some minor differences. Ttie minor diagonal matching 
significantly exceeds cheuice expectation (z a 1.80, p < .03) using a one- 
tailed test but the over-representatlon characterizes only two of the 
three diagonal cells these show a) a mark rise following an Inltlcd 
positive expectation discrepance or b) no change In marks following no 
Initial discrepance. Too few oases Involve the falling of marks for any 
conclusions to be drawn. The movement of marks upward following a prior 
condition in which e^cpeotations exceeded marks, is not as strong in 
arithmetic as in reading (30^ move up here while 60% move up in reading). 



Conduct 

A similar pattern prevails for conduct marks (Table 3*28). The minor 
diagonal is significantly over- represented (z » 1.88, p < .05) using a 
one- tailed test. The cells indicating downward mark movement following 
a negative discrepance and no mark change following no discrepance are 
only slightly over-represented (X^ observed vs. 13^ expected; and 70^ 
observed vs. 63% expected respectively). 

To sum up , in all three areas mark changes over the second year 
tend to follow expectations with the direction of movement predicted by 
the expectation aspect of the initial discrepance between marks and 
expectations . Marks move up more easily than down in response to an 
original discrepance , thus continuing the "mark" half of the bouyancy 
effect (both marks and expectations tending to move up to close an 
initial discrepance more easily than they move down to close an initial 
discrepance) noted in grade one. 



Second Grade Expectations 

Tables 3*29, 3.30, and 3*31 show the changes in expectations that 
occur durlzig the second grade following the various possible types of 
aark-e:qpectati6n discrepances, for reading, arithmetic and conduct 
respectively. The similarity of these tables allows their simultaneous 
discussion. In this section, values presented in triplicate are the 
respective values for reading, arithemtic, and conduct. 

Significant above chance major diagonal matching occurs {z ~ 3.70, 
p < .01; z - U.82, p < .01; z s 6.33, P < .01). This indicates that 
during the second grade children's expectations tend to; move up if 
they eeurlier had received a better mark than they expected, move down if 
they earlier had received a mark worse than they expected, and stay the 
same if they earlier received exactly what they expected . The rise in 
expectations following the receipt of a better mark than esqtected (69^, 
78^, and 73^) is more pronounced than the fall in expectations following 
the receipt of a worse mark than esipected (U7^, U9^, cmd 60^). Conduct 
is somewhat different than the substantive areas in that negative feed- 
back produces downward expectation revision with nearly the same certainty 
as positive feedback produces upward revision of expectations. 



Table 3.29 



Child Did B. 8. W 



than He Eagpeeted to 
Do in Reading^ 
Time 3 



Reading Eaq^eetation Went U. 8. D 
from Time 3 to Tine 4 





UP 


Sane 


Down 




Better 


18 


8 


0 


26 


Sane 


7 


22 


3 


32 


Worse 


0 


18 


16 


3U 




25 


U8 


19 


92 



Table 3.30 



Child Did B, 8, W 



than He Expected to 
Do in Arithmetic, 
Time 3 



Arithmetic Expectation Went 
U« S, D from Time 3 to Tine k 





Up 


Same 


Down 




Better 


Ik 


k 


0 


18 


Same 


11 


19 


k 


3h 


Worse 


1 


20 


20 


ki 




26 


k3 


2k 


93 
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Table 3*31 

Conduct Expectation Went U. S. D 
from Time 3 to Time k 

Up Same Down 



Child Did B. S. W 
than He Expected to 
Do in Conduct, 
Time 3 



Better 19 
Same 7 
Worse 1 
27 



7 


0 


26 


28 


7 


k2 


9 


15 


25 


kk 


22 


93 
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Esgpectstions thus also continue to contribute to a bouyanoy effect 
as noted for grade one. Escpec tat ions tend to rise to close a discrepance 
more readily than they fall to close a discrepance. (Note marks exceed 
expectations by almost exactly the same amount as expectations exceed 
marks when discrepances appear. OSiis eliminates differing degrees of 
discrepance as an explanation.) Hhe bouyanoy effect (marks and esqpeota- 
tlons moving up to close an initial dlscrepetnce more readily than they 
move down to close an initial discrepance), then, continues on into grade 
two . 

nils section on expectation changes and the preceeding one on mark 
changes provide evidence for the causal efficacy of marks on espectations 
and the causal efficacy of espectations on marks in the second grade, 
thus extending this observation beyond the first grade as reported in 
other sections of this report. 

It should be noted that both marks and esgpectatlons are reasonably 
stable during grade two. Marks remain the same at both times 3 and k 
in 6%, 72^, and 63^ of the cases for reading, arithmetic and conduct 
respectively. Expectations are a little more volatile, yet they remain 
the same in 32^, U6^, and kT% of the cases for the respective areas. 



Feedback Effects Between Grades One and Two 

Having shown that expectations and marks both display causal efficacy 
vithln grades one and two, it can be asked whether the same holds true 
between grades one and two. The conparlson involves effects of oark- 
expectatlon discrepances at the end of grade one (time 2) on marks and 
expectations at the middle of grade two (time 3). 

For the children there have been substantial changes during this 
periods Including suomer vacation when they may have forgotten about 
their academic ambitions or fears. They have entered a new classroom, 
with some new student peers, a new teacher and more advanced subject 
matter. Also they have received one further report card during this 
period. There are thus many factors that could intervene and disrupt the 
causal effects observed within grades. 

Tables 3.32, 3.33 and 3»3h show changes in marks, while Tables 3.35, 
3.36, and 3*37 show changes in expectations, following year end mark- 
expectation discrepances, for reading, arithmetic and conduct respectively. 

The following table presents the results for the tests of whether 
the diagonals (main diagonal in the case cf expectation; minor diagonal 
in the case of marks) contain an over- representation of cases. Since 
for our purposes a change in a variable following a previous discrepance 
is evidence for causal efficacy these are really tests to see if the 
causal relations hold between years. (Column headings are dependent 
variables.) 
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Table 3*32 



Reading Mark Went U. S. D from 
Tixne 2 to Time 3 



Child Did B. S. W 
than He Ejected to 
Do in Reading, 
Time 2 





up 


Same 


Down 




Better 


0 


6 


6 


12 


Same 


k 


18 


13 


35 


Worse 


Ik 


10 


8 


32 




16 


3h 


27 


79 



Table 3.33 



Arithmetic Mark Went U. S. D from 
Time 2 to Time 3 



Child Did B. W 
than He E:gpeeted to 
Do in Arithmetic, 
Tiine 2 





up 


Same 


Down 




Better 


2 


6 


13 


21 


Same 


h 


15 


17 


36 


Worse 


k 


12 


7 


23 




10 


33 


37 


80 
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Table 3*3^ 



Child Did B. S. W 
theui He £3g?ected to 
Do in Conduct » 
Time 2 





Conduct Mark Went 


U. S. D 


from 




UP 


Tine 2 to Tine 3 
Same Down 




Better 


2 


3 


7 


12 


Same 


3 


20 


12 


35 


Worse 


20 


12 


1 


33 




25 


35 


20 


80 
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Table 3.33 



Child Did B. S. W 



than He Expected to 
Do in Reading, 
Tiffle 2 



Heading Expectation Went U. S. D 
from Tioe 2 to Time 3 







Same 


Doim 




Better 


k 


7 


0 


11 


Same 


2 


21 


11 


3k 


Worse 


3 


18 


10 


31 




9 


ke 


21 


76 



Table 3.36 



Child Did B. S. W 
than He Esepected to 
Do in Arithmetic, 
Time 2 



Arithmetic Expectation Went 
U. S, D from Time 2 to Time 3 





up 


Same 


Doira 




Better 


10 


8 


2 


20 


Same 


7 


2k 


3 


3k 


Worse 


0 


13 


9 


22 




17 


k3 


Ik 


76 
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Table 3.37 



Conduct Expectation Went U, S. D 
from Time 2 to Time 3 



Child Did B. S. W 
than He Expected to 
Do in Conduct, 
Tine 2 







Same 


Down 




Better 


5 


5 


1 


11 


Same 


2 


20 


11 


33 


Worse 


2 


Ik 


16 


32 




9 


39 


28 


76 
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Significance of Diagonal (Main or Minor) Matching 



Changes in Marks 



Changes in Expectations 



Beading 
Arithmetic 



Conduct 



.01 (z = 2.78) 
N.S. (z = 1.01) 
.01 (z = 4.33) 



N.S. (z 
.01 (z 
.01 (z 



1.05) 
3.62) 
2.58) 



Clearly some causal efficacy for both marks on esgsectations and 
expectations on marks is present. 

Reading and conduct marks respond to year end discrepances as do 
arithmetic and conduct expectations. 

Vfhile arithmetic marks respond somewhat to e^cpectations they do 
not respond significantly. This might be a function of changed subject 
matter, but probably reflects the ambiguity of the evaluations of 
performance in the two sureas. Arithmetic is more easily Judged "right" 
and 'Vrong" and a child's performance is less affected by sheer effort. 
Reading performance depends in part on Intonation, "expression", and 
interpersonal variables that are harder to Judge reliably. A teacher's 
Judgments may be more easily shaded in reading or conduct and a child 
evidencing high self- expectations in these tw^ areas may exert consid- 
erable "pull" on a teacher for a better mark. Ttie changes In e^gpecta- 
tions may be the fall out which occurs subsequent to a considerable 
rise in average marks in arithmetic over the first grade year. 

It is worth noting that all diagonal cells in Tables 3*32 to 3.37 
are over-represented, even when over-representation does not attain 
significance. This foreshadows the observation that the bouyancy effect 
(marks and escpectations moving up to close an initial discrepance more 
readily than they move down to close an initial discrepance) does not 
operate between years. For changes in marks, downward movement following 
a negative discrepance (marks exceed expectations) is, if anything, more 
likely than an upward mark movement following a positive discrepance. 
For reading, arithmetic and conduct respectively the percentages of 
marks moving down (after a negative discrepance) are 50^, 62%, and 3&^t 
and the percentages moving up (after a positive discrepance) are UU^, 
17^, and 61%, The expectation aspect of the bouyancy effect has also 
largely disappeared between grades one and two. Expectations are about 
as likely to move down following a positive discrepance (expectation 
exceeds mark) as they are to move up following a negative discrepance 
(mark exceeds expectation). For reading, arithmetic and conduct respec- 
tively the percentages of expectations moving down following a positive 
discrepance (expectation exceeds mark) are 32%, kl% and 50%, and the 
percentages moving up after a negative discrepance (mark exceeds 
expectation) are 36^, and U3%* 

In cum between grudou unc and two both nuirku and expectations in 
general seem to maintain their causal cfricacy for one another when earlier 
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discrepances are cooaipared with later nark changes or expectation changes, 
^o cases (reading e^ipectations and arithmetic narks as dependent 
variables) show no significant effect but the deviations that do occur 
even in these cases are in the predicted directions. The buoyancy effect 
(especially the mark aspect of this effect) does not seem to operate 
between grades one and two. 



Parents' Expectations 



Conapared to Children's Expectations 
Reading 

What is the degree of congruence between the expectations of pcurents 
and children in reading? In only 23 out of 8o cases (31^) for reading 
is there perfect agreement. A random shuffle would, on the average, 
produce 31^ agreement so the amount of agreement is not statistically 
significant. 

In k2 of 33 cases of disagreement (76^) the parent's expectation 
is lower . All of these instances represent a child expecting an A and 
the parent expecting a B or C. There are significantly fewer Instances 
iXi = 1^.29. P < .01) where children's expectations are lower than 
parents', 2h^ of the inconsistencies are of this type. 

Another way to summarize the lack of congruence between the two sets 
of e3Q>ectations (in addition to different meems, stwdard deviations, 
margins, and only 31^ agreement) is to compute a product- moment correlation 
as an index of the degree to which peu?ents' and children's expectations 
covary. nils correlation is practically zero (0.02). Ihe margins of 
T&ble 3*38 show that approximately 90^ of both parents and childz^n e^^ect 
an A cv a B but that only about one-third of these expectations are 
consistent. 



Arithmetic 

The picture for arithmetic is in general similar to that for reading 
but there is a somewhat different kind of disagreement between parents 
and children. Again the actual amount of agreement is exactly what would 
be predicted by chance (30^^). Again, also, many children (60?t) expect 
the highest mark. For arithmetic, however, peirents are more optimistic 
than they were for reading~2^ of them (compared to l6^ for reading) 
expect the highest grade. 

The other striking difference between reading and arithmetic is that 
children cure noticeably more pessimistic, 21^ of them looking for a low 
grade (3 or k). Parents a^-e no more pessimistic for arithmetic than for 
reading (about 11^ expect a 3, none expect a U). The difference between 
parents' mean expectation and children's is 0.17 in curlthmetlc, about 
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Table 3.: 
Parent's Reading 
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2 




1 


6 


35 


Child's 








Reading 
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17 


Expectation, 








Time 1 
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2 
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k 




Total 


13 


58 



Percent l6.2 72.5 



Expectation, Time 1 

3 h Total Percent 

7 60.0 
23 28.7 

1 3 3.7 

1 6 7.5 

8 1 80 

10.0 1.2 100.0 
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Arithmetic 
Expectation, 
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Table 3.39 
Parent's Arithmetic Expectation, Time 1 
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Table 3.^ 
Barent's Conduct Expectation, Time 1 
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half the size of the mean difference for reading (0.33). There is still 
significant asymmetry (see Table 3*29) In terms of children's expectations 
exceeding parents' (x^ - h,37, P < .03) but the asymmetry is less marked. 
Parents are more optimistic about arithmetic than about reading; children 
are slightly less optimistic. 



Conduct 

The expectations for conduct are heavily skewed toward the high end 
for both parents and children. Eight parents look for a relatively poor 
shewing (a "C") in conduct but none esgpects the worst mark. For 6 of 
these 8 worried parents, the child expects an outstanding performance.' 
There are 10 children worried about conduct, but they are different 
children from the eight whom the parents are worried about. Again there 
is no significant matching between expectations (3^^^ observed, 32^ 
expected by chance; with r » .03) and significantly less optimism of 
parents compared to children (Xj - 8.32, p < .01). 

To sum up ; Parents and children do not exhibit above chance levels 
of agreement in terms of expectations. Children consistently are more 
optimistic than parents on the average. Parents' expectations are very 
much alike across the areas of reading, arithmetic, and conduct, but 
children are noticeably less confident in arithmetic than in reading 
or conduct. 



First Report Card Marks Compared to Parents' Expectations 



Reading 

Parents are better forecasters them children of marks children will 
receive in reading. In ^ of 63 instances (60^) there is agreement which 
is significantly above chance (z = 3.37, P < .01) (see Table S.^+l). 
This represents an excess of 1% over what would be expected by chance. 
Hie most interesting discrepances occur for those cases where the parent 
e:qpects a 2— in 60^ of these the child actually gets a 2, but in almost 
30^ (17 cases) the child comes home with a 1. In the other 10^ of these 
cases the child comes home with a 3. Parents as a group are good at 
forecasting the numbers of poor marks given- -about 12^ C's emd no D's. 



Arithmetic 

The first arithmetic mark likewise is much closer to parents' 
expectations than to children's- -again a high percentage (6l^) of parents 
have their guesses exactly confirmed, but the level of significance 
(z - 2.18, p < .0^) and the excess matching over chemce (10^) are both 
less than was true for reading. In this area there is no marked Imbalance 
between what is predicted by parents and how children perform in the 
direction of children outstripping parents' hopes. The same number of 
children (16) do better than parents expect as do worse (see Table 3*^2). 
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Table 3.^1 
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Arithmetic Mark, Time 1 
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Table 



Conduct Mark* Time 1 
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Conduct 



In conduct parents correctly anticipate their child's mark In 3dji 
of the cases while only 39^ would be expected to do so by chance (z s 3.60, 
p < .01). In addition, when parents' guesses disagree there Is a 
significant tendency for them to underestimate their child's performance 
(xf « »*.36, p< .05). 

To sum up : In all three marking ax^as about 60^ of parents can 
forecast their child's mark exactly. In each area this degree of 
accuracy Is slgnlflceuit In statistical tezns but perhaps more Ijqportant, 
the accuracy Is Impressive in absolute terms. In both reading and conduct 
parents ' Inaccuracy Is apt to lie In under -estimating. Parents as a 
group seem aware of the marking distributions actually used by the 
first-grade teachers. 



The Consistency of Parents ' Expectations From First to Second Grade 

In second grade 71% of the same parents who had given their 
expectations In grade one were asked again for escpectatlons (their child 
now being In grade two). About half the parents (28 out of 39) shifted 
their exgpectatlons for reading from grade 1 to grade 2. Parents show a 
slight, but not a slgnlflcemtly greater than chance, likelihood of holding 
the same reading esqpectatlon at the two times (z s 1.63, N.S.). As noted 
earlier, upon first inquiry parents tended to "play it safe" by expecting 
a 2. Of the parents who had earlier expected a 2 (73^)9 now a little 
over half still expected a 2, but about a third looked for a 1 in second 
grade. At second grade a substantlcd number of parents then have raised 
their expectations although the total nuoiber of parents expecting I's or 
2's is approximately the same. All of those who looked for poor marks 
in first grade are looking for A's and B's in second grade, but about 
the same percentage of parents as before (10^) look for C's. 

In etrlthmetic the margined distributions for parents' first-grade 
and second-grade expectcmcies are almost the same, and there is signifi- 
cant consistency (63^) between parents' guesses from one year to the 
next (z s 3.30, p < .01). Ten have shaved their hopes by one grade 
interval, while nine have increased their hopes the same amount. The 
overall patterns cure very similar from one year to the next. 

Conduct looks slmileur to arithmetic including the high consistency 
(67^) from one year to the next (2 = 3.68, p < .01). Twenty-five per 
cent of the parents who previously expected a 2 come to expect a 1 in 
grade two. 

In sum, parents display a tendency to hold the same expectation 
from the first grade to the second (significantly so for arithmetic and 
conduct). No drastic marginal changes occur but slight upwaxxl movements 
in expectations are present for those previously expecting 2's in reading 
and conduct. 
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Table 3M 
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Are Parents' Expectations for Second Grade Affected by Earlier Marka? 



Reading 

Parents' expectations In second grade match to a significant 
degree (z = 3.7^, P < .01) the performances their children have earlier 
displayed. In fact 6l% of the parents' expectations match earlier 
marks, compared to the 38^ consistencies expected by chance. Those 
whose expectations differ from the previous mark are about eqiually 
divided In dlrectlon»l6^ look for Ijqprovement and 23^ look for a 
decline. 



Arithmetic 

The picture for arithmetic Is the same as for reading In that there 
Is considerable concordance between parents' expectations for second 
grade marks and the marks actually received at the end of first grade 
(z * 2.09, P < .05); 5h% of parents believe their child will perform at 
the same level In arithmetic as eeurller {hl% would be expected to do 
so by chance). There Is an Interesting asymmetry though, in that of 
those parents who forecast a different mark {k&f) 11% look for a better 
mark but 3^% look for a poorer mark.' This asymmetry is moderately 
significant {x\ » 6.03» P C .05). Parents thus manifest considerably- 
less confidence in their children's arithmetic performance than In 
reading performance. 



Conduct 

Middle class parents etre realists when it comes to conduct, or at 
least they seem to use the same marginals teachers use in evaluating 
conduct. There is a highly significant amount of matching (z = 4. 56, 
p < .01)— 66^ of the parents expect the same conduct mark in second 
grade as was obtained in first (compared to 39^ matches es^ected by 
chance). Those who expect a different grade split about equally in 
each direction. 

In sum , arithmetic is the only area that displays asymmetry— parents 
tending to expect lower mEkrks than their children received at the end of 
grade one. All three areas display significant matching between the 
parents' second grade expectation and children's markp at the end of grade 
one. This matching may be thought of as either (a) the parents adopting 
of the earlier marks as their later expectation or (b) as a spurious 
relationship with parents' first grade expectation as the Independent 
variable. (Previously it was shown that parents' grade one expectations 
matched their grade two expectations. If their first expectations also 
match the children's marks as the end of the first year (as they do for 
arithmetic and conduct in the ccmbined analysis page 5-U9) the observed 
matching could partially be a function of the fact that both variables 
now being considered matched with the parents' time 1 expectations.) 
The matching between parents' expectations at times 1 and 3 is not that 
Gtron^y however (actually not significant in the case of reading), so 
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this explanation alone is probably insufficient to produce the observed 
matching there— especially in reading. It remains, however, a source 
that possibly contributes substantially to the observed matching between 
parents' second year es^ectations and arithmetic and conduct marks at 
the end of grade one. 

The reader might also note that the causal efficacy of a discrepance 
between year end marks on changes in parents ' expectations has not yet 
been examined. This awaits further analysis. 



Sex Differences 

Sex and Marks 



First Marks 

The major difference in first report cards by sex is that, of 10 
children receiving 3's In reading , 9 are boys. (No U's are given.) 
There is a fairly even division by sex In tems of arithmetic marks. 
In conduct 63% of the girls, compared to 2k% of the boys received the 
highest mark (1). Only 7% of the girls receive a 3 in conduct, euid 
none receives a k. For boys, 22^ receive a 3 and one boys receives 
a k. Boys clearly receive lower first conduct marks. 



Marks at End of First Grade 

There is more skewing in opposite directions by sex at the time the 
second mark in reading is given. Whereas at midyear I's were assigned 
about equally to boys and girls, at the end of the year about 6l% of the 
girls, compared to 36^ of the boys, receive the highest mark. There is 
as well some clumping of boys at the lower end of the distribution—more 
than 26% of the boys get the lowest two marks whereas only one girl 
receives a "3"> the lowest mark assigned to a girl. The association 
between marks in reading and sex Is moderately significant (x! = 9*07> 
p < .05) (see Table S.'+S). 

There is some slight superiority of girls in terms of arithmetic 
marks— all get I's or 2's, whereas about 10^ of the boys get 3's, but 
sex differences are small and non-slgniflceuit. 

In conduct boys again receive noticeably lower grades than girls. 
Whereas over 30^ of girls receive a "1", less than 20^ of boys receive 
a "1". In addition, only two girls get less than a "2" whereas almost 
one-third of the boys fall below a "2". Using only the categories "1", 
"2", and "3" for a comparison by sex, one finds a highly significant 
association (xf = 15.39) between sex and marks In conduct. The awarding 
of better conduct marks to girls continues a trend begun when first 
marks are given. The polarization by sex has Increased over the first 
year. On first marking l6^ of children received a "3" or less of whom 
about k% are girls. On second marking about 20^ receive a "3" or less 
of whom 2^0 are girls. 



Table 



Sex 



N&le 
Female 

Total 
Percent 



17 
23 



Reading Mark, Time 2 
2 3k Total Percent 



18 



11 
1 



1*0 32 12 1 
U7.1 37.6 lk,l 1.2 



hi 
38 

85 



55.3 
Uk.7 



100.0 
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Male 
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Total 
Percent 



17 
17 



Arithmetic Mark, Time 2 
2 3 Total Percent 



25 
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23 
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Mferks at Midyear, Grade 2 



By midyear second grade 22^ of boys get a "3" or "U" in reading , 
approximately duplicating the percentage seen at the end of first grade. 
The performance of girls is not as good at midyear second grade as it 
%Kt8 at the end of first grade however, because l6^ get 3's at this time 
(only 3^ got 3's at year end). The majority of girls get the highest 
mark, nonetheless, and clearly outshine boys in tezms of superior marks. 

Arithmetic marks also shov some interesting shifts between first 
and second grade. At first the modal mark for both sexes is a "2" and 
little use is made of the "3" category (U^ boys, girls). Then at the 
end of grade one, although the modal mark for both sexes continues to be 
a "2", substantial numbers of both boys and girls get I's (almost half 
the girls, and well over a third of boys) . Eleven per cent of the boys 
get 3's while no girls do. At the middle of second grade, there is a 
sharp drop. Only 1^ of boys and iQfji of girls get I's and hi^er per- 
centages of both sexes get 3's (20^ of boys and 2k% of girlsTI There is 
thus considerable difference between first and second grades in how 
teachers assign arithmetic marks. One might think of a "honeymoon" year, 
followed by a thumping return to reality. In the second grade the 
picture for conduct remains about the same as it was in first grade, with 
perhaps a little less leniency toward girls. 



Marks at the End of Second Grade 

At the end of grade two the sex difference in reading marks has 
diminished and there are relatively few marks given to either sex below 
a 2 (only 7^). Girls still outperform boys in terms of A-marks (66^ of 
girls get A's coopared to kl% of boys). 

In arithmetic boys are more variable in performance than girls. Over 
28^ of boys get A's, compared to 21^ of girls. But 17% of boys get 3's 
and k'B compared to 11^ of girls. The average mark by sex is 1.91 for 
boys compared to I.89 for girls but boys are considerably more variable. 

In conduct there is a strong trend for girls to get better marks. 
Over 60^ of girls get A's and only 8^ get C's (no D's). Boys, on the 
other hand, have a relatively large percentage getting C or D (20^) and 
only 2Bf]o receiving A's. 

To sum up ; Marks in reading show girls expending greater effort 
over the first two grades and receiving a disproportionate share of A's, 
starting from an approximately equal assignment of A's on the very first 
report card. Arithmetic marks show very little difference in average 
level by sex at any time, with the exception that boys' marks at times 
display more variability. In conduct girls outperform boys at every 
marking period, a finding which is not surprising. 
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Sex and Children's Expectations 



First Qrade 

There are no strong sex dlt'i'erenoes In expectations for reading 
before the first marks are received. Girls are a little more hopefU. for 
A*s rather than B's. There is an interesting pattern in initial expecta* 
tions for Gurlthmetic, perhaps reflecting sex role socialization. About 
21% of boys expect less than a B, compared to 2^ of girls but whereas 
no boy expects the lowest grade, Ti> of girls do. Thus a sllgjbtly higher 
percentage of girls expect a low grade and a few expect the worst. In 
conduct the expectations of the sexes are the sane, and rather trite. 

At the end of the first grade the expectations of both sexes are 
highly polarized at the high end for reading. The changes in arithmetic 
expectations are toward highly similar distributions for the sexes. In 
conduct 80^ of girls expect the highest grade compared to something less 
than half of the boys. Thiii expectation is veridical given the high 
percentage of A's in conduct assigned to girls. 



Second Grade 

In second grade high expectations continue to predominate for both 
sexes in reading and arithmetic, but there has been a noticeable 
moderation in reading expectations, the subject where children by now have 
the most feedback. At the end of first grade $8j( of the boys and 6?^ of 
the girls expected an A. In the middle of second grade kOjl, of boys and 
^0^ of girls expect an A in reading. In arithmetic children's expectations 
at the end of grade one and in the middle of grade two are hl^y similar 
and not much different by sex. The percentage of boys ejecting low 
conduct marks Increases from the end of first to the middle of second 
grade, form 17% to 26%, but the two sexes remain quite similar in terms 
of e^ec tat ions. 

By the end of second grade for the first time less them 30% of the 
boys expect an A in reading. Although a majority of the girls (53%) still 
look for an A in reading, this is a smaller majority ttian heretofore. 
The sexes still do not differ significantly in reading expectations. 
Likewise there has been considerable moderation in boys' esgpectations for 
arithmetic even though the modal category (U7%) is A. For girls there 
also is a noticeable moderation, the modal category (^0%) now being a B. 
The sexes again appecur quite similar. In conduct there are slightly 
fewer low esqpectations for both sexes at the end of second grade. Once 
again the sexes remain similar. 

In arithmetic children's expectations at successive time points 
continuously have some low. cases (at the end of second grade 11% of the 
boys and % of girlo %till look for the lowest two grades). This same 
trend can be seen in every earlier table for arithmetic expectations- 
some children are pessimistic. There seems to be much less pessimism for 
reading— only 3% of the girls and 7% of boys look for a 3 or less in 
reading at the end of second grade. At the end of first grade 2% of 
girls and U% of boys looked for less than a "B" in reading, whereas 13% 
of boys and 12% of girls looked for low marks in arithmetic at that tijne. 

erJc : : : ; ^ " ^ 



With small numbers of cases conclusions are not firm but both teachers 
and students seem to define performance very differently in reading and 
arithmetic. The difiTerent^ and more rigorous, marking standards for 
arithmetic have earlier been noted, as has the negative association 
between e3g;>ectation8 In the two areas. 

There is of course considerable tolerance in everjrday life for 
persons who are "poor" at arithmetic but relatively little tolerance for 
failures in reading. Some persons, In fact, announce with resignation, 
or even pride, that they "cannot add 1 and 1" but one never hears 
acknowledgment of difficulty in reading, it may be that children in 
this study reflect the mores of the larger society in their scaling down 
of hopes for achievement in arithmetic and in their openness in acknowl- 
edging a low level of aspiration. They do not seem to have the same 
attitude toward reading achievement. 

As a general summary statement the most c onsistent observation is 
that the sexes do not differ in expectations over the two year period 
observed (conduct at time 2 is the only exception). 



Sex and Parents' Expectations 

Parents have very sllglitly higher expectations for girls' initial 
reading marks than for boys— about l6^ of boys are expected to receive 
less than a "B" compared to 6^ of girls. Since fewer boys than girls 
expect higher marks the degree of congruence between parents and children 
of both sexes is reasonably similar (36^ and 23% matching for boys and 
girls respectively). 

Surprisingly, there is only a slight margin in parents* expectations 
for high arithmetic marks favoring boys (30^ vs. 23% expect A's for boys 
and girls respectively). There is less discrepance, however, between the 
sexes in a coiqparison of parents* and children's esgpectations for arith- 
metic than was the case for reading. About 28^ of girls ' parents have 
expectations that match their daughters' eogpectations for arithmetic 
compared to 32^ of parents having expectations that match their sons'. 

Parents ' xpectations for conduct are different for the two sexes- 
girls are expected to behave better than boys. Whereas 32^ of girls' 
parents look for the highest grade in conduct, only 10^ of boys' parents 
expect this. Rather surprisingly there is about the same low degree of 
congruence between parents' and children's conduct expectations by sex, 
with 37% matches for boys and 33% matches for girls. 

Parents seem to display for each sex no more than chance matching 
of esgpectations— as one might have expected given that parents' expectations 
do not match children's expectations in general (see the discussion of 
parents* expectations) . 

In sum, parents do not seem to hold different expectations for boys 
and girls except possibly in conduct. 
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8ex« Mfetrks and Parents Expectations 



In terms of actual marks received in reading, all but 13$ of boys 
do as well or better than their parents ea^ect in reading. The figure 
for girls is 11^. What parents expect and marks actually received by 
the sexes are coiqparable. 

In arithmetic Just under 2% of the boys do worse than parents 
expected, while this is true for only 13$ of girls. The distributions of 
marks as awarded are similar for boys and for girls but as mentioned above, 
parents do seem to have slightly higher hopes for boys than for girls In 
arithmetic. Overall, though, it seems that the sexes are conoparable in 
the relations displayed between arithmetic marks and parents' eacpectations. 

Parents e:q?ect boys to be less well-behaved than girls (as mentioned, 
33% of girls and 10% of boys are expected to get A* s in conduct) but 
teachers award many more high marks to girls in conduct than to boys {6l% 
vs. 26% get A's). The congruence between parents' expectations and 
marks received in conduct turns out to be roughly equivalent for the two 
sexes because the marks awarded are similarly Inibalanced by sex. 

In sum the relationship between parents' expectations and metrks in 
all three areas are reasonably conoparable between the sexes despite sex 
differences in marks and moderate differences in expectations. 



Sex« Marks and Children's Expectations 

Girls are more optimistic than boys about the first m»irk in reading 
(68^ vs. 52^ expect A's) and teachers g1y« a few more A's— and noticeably 
more C's— to boys in reading on the first report card. Over 18$ of boys 
/f*«^?'® conpared to a single girl (k%) getting a C. Almost half the girls 
W%) get a lover mark in reading than they expect, and a leurge percentage 
of boys (39%) pIsc get a mark lower them they expect. What is striking, 
though, is that about 25% of the boys who expect an "A" actually get a 
"C"— this does not happen to girls as frequently (4%). 

In arithmetic similar findings prevail in terms of marks coBpared to 
expectations— half the boys get a lower mark in arithmetic than they 
expected and 39$ of girls do. There is no marked increase in low marks 
for boys as occiirred for reading however. 

In conduct Leys and girls ar^ very optimistic, but the girls' 
optimism is more fully supported by later events than the boys ' . Teachers 
give only 8% of girls a conduot mark below a "B" compared to 23% of boys. 
With this almost a third of the girls get a higher rnark in conduct than 
they expected whereas only about l8% of boys get a higher mark than 
expected. 

The chance levels of matching between first marks and expectations 
observed in the whole conort are evident in all three areas for both boys 
and girls. 
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Relations with I.Q. 



While a significant relationship between I.Q. (FM/l) and reading is 
present at the middle of first grade (r s ,22h, p < .05) it is not until 
the middle of second grade that significant relationships betw een I.Q. 
and marks In reading and arithmetic again reappear (r = .273, P < .01 and 
•37^, P < .01, respectively). Oliese relations fall off (to non- significance 
In the case of reading) at the end of second grade, however, whether the 
early (I^) or later (SFTAA from STm) I.Q. measure Is used (FMA: r = 
.lU5, N.8., r = .196, p < .05, and SFPAA: r = .131, N.S., r = .201, p < .05 
for reading and arithmetic respectively). There is a sizeable correlation 
(.655) between I.Q. measured In first and second grades using the two 
different tests. 

Because the marks are not a continuously distributed variable, and 
because the large majority of cases fall in only the two uppermost mark 
categories, the relationships between I.Q. and marks may be attenuated 
to a considerable extent. 

If qiiartiles are formed on the I.Q. distribution (the necessary cut 
points being 12U,5, II6.5, and 109-5) there Is little difference in the 
distributions of first reading marks within the two middle categories 
and there are moderate differences within the two extreme categories. 
Of the 19 cases in the upper I.Q. quart lie 58^ get I's and U2^ get a 
lower mark. Of the 20 cases in the lowest quartlle 20^ get I's and 80^ 
get a lower mark. To give some notion of the lack of relation between 
marks and I.Q. , at the end of first grade those children receiving the 
highest mark ("1") In reading, come from the different quartlles (from 
highest to lowest) In the following proportions: .2k, .lU, .38, ,2k, 
Th« picture is very similar In arithmetic— .21, .12, .^3, ,2k. 

By the middle of second grade, there is a little more consistency 
in terms of I.Q. quartlle and mark received. This is most noticeable in 
arithmetic where of those children receiving a "1" in arithmetic, the 
quartlle percentages are as follows: U6^, 1%, 13%, 23$, Of those 
children receiving a "3" In arithmetic, the queu^tlle percentages are: 
0^, l%y 2k%, 59^. At the end of second grade the quartlle frequencies 
are less "consistent" than they were in the middle of the second grade 
year— reflecting the decline in correlation. 

The agreement between I.Q. quartlles for grades l(FI4A) and 2 
(SFTAA) is best at the high extreme (see Table 3-52). About 86^ of those 
classified in the highest quartlle in first grade are classified the same 
way again, and the remaining lk% are reclassified into the second quartile. 
Of those classified in the lowest quartile in grade 1, a little less than 
half {k&^) receive the same quartile rating in grade 2, but have 
jumped to the top quartile. But of those classified in the third quartile 
in grade 1 — the quartile that shows the greatest discrepance upon retesting 
in grade 2— over a third have jumped to the top quartlle by grade 2 and 
1% have slipped to the lowest quartlle at the time of second testing. 

These distributions illustrate a fact well-known but little 
appreciated: with young children high I.Q. scores are more apt to be 
accurate than low ones. After all, there ic almost no way that a child 



Table 3.^8 



Reading Mark, Time 1 
12 3 Total Percent 





Ist » High 


11 


6 


2 


19 


23.2 


Categorized 


2nd 


5 


11 


3 


19 


23.2 


I.Q. 














(PMA) 


3rd 


10 


Ik 




2lf 


29.3 


in Grade 1 
















Uth - Low 


k 


11 


5 


20 


2k.k 




Total 


30 


k2 


10 


62 






Percent 


36.6 


51.2 


12.2 




100.0 
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Table 3*^9 



Reading Mark, Tine 3 







1 


2 


3 


k 


Total 


Percent 




let s High 


0 


5 


1 




Ik 


17.7 


Categorized 
I.Q. 
(PMA) 
in Grade 1 


2nd 
3rd 


6 
9 


9 
10 


3 


1 


19 
23 


2lf.l 
29.1 




Uth = Low 


5 


12 


3 


3 


23 


29.1 




Total 


28 


36 


U 


k 


79 






Percent 


35A 


U5.6 


13.9 


5.1 




100.0 



Table 3.50 

Arithmetic Mark, Time 3 
123 Total Percent 



1st = High 


6 


8 




Ik 


17.5 


Categorized g^id 
I.Q. 

in Grade 1 


2 
2 


Ik 
17 


3 
k 


19 
23 


23.7 
28.7 


kth = Low 


3 


11 


10 


2k 


30.0 


Total 


13 


50 


17 


80 




Percent 


16.2 


62.5 


21.2 




100.0 
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Table 3.31 



Arithmetic Vtark, Time k 





1 


2 


3 


if 


Total 


Percent 


(SPTAA) 2nd 

in Grade 2* « . 

3rd 


11 

3 
6 


13 
22 
15 


k 
1 




29 
22 


26.4 
21.6 


Uth s Low 


3 


Ik 


6 


1 


24 


23.5 


Total 


23 


6k 


14 


1 


102 




Percent 


22.5 


62.7 


13.7 


1.0 




100.0 


* (cut points at 111.5, 103.5, and 9k.^) 










Table 3*52 








Categorized I.Q. 


(SFTAA) in Grade 2 






High = 
1st 


2nd 


3rd 


Low » 
4th 


Total 


Percent 


let s High 

Categorized 

I.Q. 2nd 

(PMA) 

in Grade 1 3rd 


12 

5 
8 


2 
10 

7 


3 
5 


1 
3 


14 
19 

"23 


17.5 
23.7 
28.7 


4th « Low 


2 


1+ 


r 


11 


24 


30.0 


Total 


27 


23 


15 


15 


80 




Percent 


33.7 


28.7 


18.7 


18.7 




100.0 
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can score much higher than his ability* but there are innumerable ways 
he can score lower (misunderstanding directionsi inattention, anxiety, 
poor vision, etc. ) . 

While the correlation between I.Q. measures at the two times is 
sizeable {»653) it is not as high as would be preferred for a retest. 
With this low a reliability, true score I.Q. -mark correlations may have 
some difficulty displaying themselves. 

It should be noted in the interpretations of the above results that 
the marking policy of the middle-class school (marking students in terms 
of their own ability) has the effect that teachers are atteoptlng to 
partial out I.Q., or an I.Q. like coqponent, from the marks they assign. 
To the exti^nt they are successful I.Q. -mark correlations will be 
diminished. The presence of some significant I.Q. -mark correlations 
Indicates teachers are not botally successful at partialllng out ability— 
at least "ability" as measured by the I.Q. tests the teachers had access 
to— but the extent to which they are successful remains largely unknown. 



Relation Between Self-Esteem Test and Expectations 



The self-esteem test developed by Dicksteln (1972) has already been 
described in Chapter 2. The reader will recall that It has two forms, 
one for boys and one for girls. The correlations that follow are generally 
based on from Uo to k3 boys euid 31 to 37 girls. 



Boys 

No correlations of any of the three self-esteem factors and marks 
received at midyear first grade are significant at the % level for boys, 
although all are positive (9 coefficients). Factor II (athletic image) 
does correlate in excess of .20 with all three marks. 

Of the nine correlations between boys' midyear (grade one) expectations 
in the three areas and the self-esteem factors, the only correlation that 
attains significance (p < .03) is that between conduct expectation and 
factor III (student Image). For boys, expecting a good conduct mark 
correlates positively with holding a good student Image (.269)— whiuu 
Is not too surprising. It is more surprising that the student Image 
factor does not correlate significantly with academic expectations (.213 
and .072 for reading and arithmetic respectively). 

At the end of first grade there is a highly significant correlation 
between factor III (student image) and boy's mark in reading (r = »kOh, 
p < .01), and a moderately significant correlation with boys' mark in 
arithmetic (.267, P < .0^). Student image also correlates signifioemtly 
with arithmetic eacpectations at year end (.25I+, P < .05) but not with 
reading expectation (.182, N.S.). Factor I (good citizen image) correlates 
with boys' expected mark in conduct (r ^ .3^1, P < .05) at year end. 
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There is no signifloeuit relation between I.Q. and the three self- 
esteem factors though all are positive and factor I (good citizen iioage) 
comes very close to significance {r ~ .2^7). 

There is a significant correlation between the scores on factor I 
as measured in first grade and factor I as measured in second grade— a 
12 to 15 month time span (r s .357, p < .05). Ttie over- time correlation 
for factor II is much more substantial— .520— and also highly signifi- 
cant (p < .01) as is factor III (.^37). 

Factor III, the student image as measured in grade 2, again Shews 
a significant correlation with the marks received by boys in reading 
(r = .285, p < .059 n = 53) and the marks received in arithmetic (r s .297) 
p < .05). In addition, feustor I (good citizen image) shows fairly 
sizeable positive correlations with all three marks at the end of grade 2, 
with those for reading and conduct being significant (.276 and .310, 
respectively) . 



Girls 

Ukie same kind of relationships between self-esteem factors emd 
school variables were tested for girls. 

At first marking (midyear, first grade) there were no discernible 
relationships between any of the self-esteem factors and girls' marks in 
the three criterion subjects. At the end of first grade, aom sizeable 
relationships began to surface. Factor II (athletic image) tiien 
correlates with both reading and arithmetic marks (r = -.3359 P < .05 
and '.kOk, p < .05) and factor III (social Image) correlates with 
arithmetic (r = ',k03t P < .05) but these are all in the opposite direction 
compared to boys . For girls, low self-exteem on the factors correlates 
with high marks J The non- significant correlations at the end of grade 
one are mixed as to their sign. Hhere is only one sigi if leant 
correlation between girls' expectations and self-esteem factors at the 
middle of first grade, that of athletic Image (factor II) and reading 
expectation (-.308, p < .05). It is in the direction of high self-esteem 
being associated with a low esgpectation. At the end of first grade 
factor II (athletic image) continues to correlate in this fashion with 
reading expectations (r = -.3259 P < .05). At both the siiddle and the 
end of grade one the non- significant correlations are mixed as to sign. 

As far as I.Q. is concerned, there is no significant association 
between it and any of the self-esteem factors early in first grade. 

There is no association between scores on factor I (good citizen image) 
as measured early in first grade and as measured at the end of second 
grade. There is a similar lack of correlation for factor III (social image). 
But there is a significant (r » .381, p < .05) association for factor II 
(athletic Image). At the end of second grade, there are still no 
significant correlations between self-esteem (as measured in grade 2) and 
any marks for girls. 
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In summary boys' and girls' marks and expectations display quite 
different relations to the various self-esteem factors— Including changes 
In both sign and magnitude of correlations. 
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CHAPTER h 
MIDDLE-CLASS SCHOOL, COHORT 2 
( STARTING FIRST GRADE IN SEPT. 1972^ 



A second cohort at the middle-class school was followed starting 
in September, 1972, consisting of all children beginning first grade in 
that school at that time. Complete data for these children cover the 
school year 1972- 73 9 although some prior data in terms of kindergarten 
teachers' forecasts were obtained. These children are presently being 
followed in second grade and results will be available in future 
reports. They have demographic characteristics like those for Cohort 1 
in the same school (see Table 3 •I)* 

This chapter is short because most of the data pertaining to Cohort 
2 are given in Chapter 3 (combined with data for Cohort 1). In this 
chapter only a few points will be covered where there seem to be differ- 
ences between cohorts or where the details of cohort -to- cohort cosqparisons 
are illuminating. 



Tabular Summary 



In reading parents and children esgpect virtually the same performance 
(1.80 vs. 1.79 respectively). In arithmetic parents are less optimistic 
than children (2.03 vs. 1.9^9 both are somewhat more guarded about 
arithmetic as compared to reading. For conduct parents are more 
optimistic than children (1.97 vs. 2.1?). As with Cohort 1, the 
variability of children's forecasts exceeds parents'. Cozqpared with 
Cohort 1, parents look very much the same (their expectations ranging 
from 1.80 to 2.03 in Cohort 2 vs. 1.83 to I.96 in Cohort 1). Children 
of Cohort 2 are less optimistic than children of Cohort 1 (ranging 
from 1.79 to 2.12 vs. I.63 to 1.68^. (See Table 5-1 for a complete 
summary) . 



Discrepances Between Initial Expectations 
and First Report Card 



Reading 

The teachers, first of all, give almost all A's and B's at the first 
marking opportunity; of the children get B*s and 3^+^ get A's. The 
children, by contrast, expect many more A's (over half of them expect A's) 
and many more oxpocb a relatively low mark than aqtually are awarded low 



T&ble k.l 



Reading mark, Tine 1 







1 


2 




XOu&X 


^&W1 ^ 




1 


15 


28 


3 


U6 


51.1 


Reading, 


2 


11 


9 


2 


22 


2U.U 


Expectation, 
Time 1 


3 


3 


11 


3 


17 


18.9 




h 


2 


2 


1 


5 


5.6 




Total 


31 


50 


9 


90 






Percent 


3U.1* 


55.6 


10.0 




100.0 



Table U.2 



Arithmetic Mark, Time 1 







1 


2 


3 


Total 


Percent 




1 


k 


18 


3 


25 


27.8 


Arithmetic 


2 


12 


28 


9 


U9 


54.1* 


Expectation, 


> 








Time 1 


3 


1 


8 


2 


11 


12.2 




k 




2 


3 


5 


5.6 




Total 


17 


56 


17 


90 






Percent 


18.9 


62.2 


18.9 




100.0 
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Table U.3 



Conduct Mark, Time 1 







1 


2 


o 


). 




Percent 




1 


10 


21 


U 




35 


38.9 


Conduct 


2 


h 


8 


6 




18 


20.0 


Eigpectation, 












Time 1 


3 


6 


16 


5 




27 


30.0 




k 


2 


6 


1 


1 


10 


11.1 




Total 


22 


51 


16 


1 


90 






Percent 


2k.k 


56.7 


17.8 


1.1 




100.0 
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marks. About 2% expect a C or D while only 10^ get C's, and none 
lower. The greatest difference between this Cohort and Cohort 1 lies 
here— for only 10^ of the children in Cohort 1 eacpected a C or a D in 
reading initially. The predicted numbers of matches between expectations 
and marks in reading, 29.77* is close to the number of matches, observed, 
27, so the extert of of matching between Initial expectation and marks 
in reading is random (Table U.l). 

It is interesting to examine the 2X2 array in the upper left 
comer of Table k,l where more than two-thirds of the cases fall. 
Children's expectations for A's and B's are not consonant with marks 
received— of the large numbers (43) who expect A's, the majority 
receive B's. Of those expecting B's (20) the majority receive A's. This 
situation almost exactly duplicates what was seen in Cohort 1. 



Arithmetic 

Children in Cohort 2 are considerably less optimistic about 
arithmetic than children in Cohort 1. Although ^ look for A's and 
B's, only 2^ look for an A. This compares with 89^ looking for A's or 
B's in Cohort 1, with 6l% looking for an A. CdOipared on escpectations 
for reading children of Cohort 2 are much more guarded in arithmetic, 
while children of Cohort 1 were more optimistic about arithmetic. Also, 
there is a different division between A's and B's in the different 
substantive areas, in arithmetic more than 30^ look for B's whereas in 
reading the modal (and majority) expectation was for an A. 

Teachers do waxk arithmetic somewhat more severely, because in 
Cohort 2 12^ get C's whereas in Cohort 1 only received C's. 

The amount of matching between children's expectations and marks 
is within sampling error of the 37.29 matches predicted by chance {3k 
matches are observed). There is also a fairly even division between 
those who get a higher grade than expected and those ^o get a lower 
grade than expected. 



Conduct 

Ttie picture here is not particularly interesting. The amount of 
matching {2k) is well within chance expectancy (23.7), there is no trend 
for e3Q)ectation8 to exceed markt< or the reverse, but teeichers give notice- 
ably fewer A's in conduct to Cohort 2 {2k%) than to Cohort 1 (^3^). Also, 
interestingly, childi'en in Cohort 2 have notlaeably lower eaqpectations 
for conduct (59?^ look for A's or B's conparei to 87^ in Cohort l). 



Kindergarten Teachers' Forecasts 



Since teachers' expectations are often assumed to influence future 
performance, it is of interest to see how well kindergarten teachers' 
forecasts for performance in first frrade are borne out. These data are 
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Table U,k 



Reading 
Mark, 
Time 1 



Kindergarten Teacher's 
Reading Escpectation 

1234 Total Percent 
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3 


16 


3 




22 


36.1 


2 


10 


12 


9 
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35 


57.4 


3 


1 


2 


1 




k ■ 


6.6 


Total 


Ik 


30 


13 


k 


61 




Percent 


23.0 


h9.2 


21.3 


6.6 




100.0 
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available only for Cohort 2. Forecasts were obtained by asking kinder- 
garten teachers to guess how well children would do in first grade. Bie 
guesses were obtained at the end of kindergarten, Just before school 
closed, after the teacher had spent 8 to 9 months in fairly close 
association with the children. 

The forecasts are far off the mark. There is less congruence than 
would be expected by chance. Whereas 23.11 exact matches would be 
predicted, only l6 are noted (z = 2.0k, p < .05). Also the kindergarten 
teachers are considerably less optimistic than the children's performance 
warrants, and also less optimistic than parents are. They look for 
about 72% to get A's and B's, and over 93% of the children actually do 
so. Also 32% of parents look for A's whereas fewer {23%) of kindergarten 
teachers do. The kindergarten teachers' lack of optimism is shown in the 
departure from symmetry around the main diagonal (X = 7.20, p < .01) 
brought about because more often than not, when expectations and marks 
received do not agree, it is the teacher's expectation which is lower. 
The kindergarten teachers seem to be less aware of the grading 
distributions used by the first grade teachers than parents are, for 
parents' marginals looked very much like the actual distribution of 
marks awarded in reading whereas kindergarten teachers expected 2&ji, to 
get C's or D's compared to 7% actually receiving these low marks. 



What is the Correspondence Between Marks 
from Midyear to Year End? 



Reading 

In reading the tea-chers' marginals are fairly stable from one 
marking period to the next for Cohort 2. If anything a little more 
leniency appears as the year goes on. Over 91^ get A's and B's on second 
marking. In 62% of the cases the mark awarded stays the same— a degree 
of over-matching that is highly significant (p < .01). There are about 
equal numbers of children whose marks decline as there are those whose 
marks improve, m Cohort 1, on the other hand, teachers are noticeably 
more generous with A's in reading at the end of the year compared to the 
midyear {h7% vs. 33%). This leads to a smaller percentage of matching 
marks {3%) on the two occasions for Cohort 1 thcui is the case for Cohort 
2. 



Arithmetic 

In arithmetic the picture is more complex. There is a high degree 
of matching of arithmetic marks over the year {67%). Hhia is significantly 
(p < .01) above the number of matches expected by chance {h2%). Teachers 
tend to assign more A's at year end (36^ vs. 18^) and fewer C's or D's 
(11^ vs. 19^). In Cohort 1, more A's were assigned at year end {kl% vs. 
22%) and there were the same niunber of c's {%) both times. 
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Table 



Reading Mark, Time 2 

123 Total Percent 

1 21 10 31 33.3 

Reading 

Mark, 2 9 36 7 52 55.9 

Time 1 

3 9 1 10 10.8 

Total 30 55 8 93 
Percent 32.3 59.1 8.6 100.0 



Table U.6 

Arithmetic Mark, Time 2 
1 2 3 U Total Percent 
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1 




17 


18.3 
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ko 
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1 


58 


62.U 


Mark, 
















Time 1 
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2 


0 


7 




18 


19.^ 




TotaJ. 


33 


50 


9 


1 


93 






1 

Percent 


35.5 


53.8 


9.7 


1.1 




100.0 



Table k,7 

Conduct Mark, Time 2 
12 3 4 Total Percent 
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2 
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23.7 
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37 
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1 




58.1 


Mark, 












Time 1 


3 
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10 


5 




16 


17.2 
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1 


1.1 




Total 




h9 


9 


1 


93 






Percent 


36.6 


52.7 


9.7 


1.1 




100.0 
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In Cohort 2 the degree of leniency for arithmetic marks as the year 
progresses looks about eqml to that observed for reading»8l^ get A's 
and B's at first mark and 89^ do so at second mark. There is however, 
a very different pattern around the main diagonal. Whereas slightly 
over of children's marks in arithmetic go down, 31^ go up. There 
are, in addition two children whose marks fall by 2 notches and 2 whose 
marks rise by two notches. Because the teachers' grading distributions 
are initially "harder" in arithmetic, they can award twice as many A's 
at the second mark as they do at the first. In reading, by contrast, the 
same proportion of A's is awarded on both occasions. 



Conduct 

Ihe pattern over time for marks in conduct closely resembles that 
for arithmetic. Conduct also displays a significant over-natchlng of 
marks at midyear and year end (see Table U.7 ). 

It is rather surprising that teachers' marks differ as much as they 
do from Cohort 1 to Cohort 2. 



Parents' Expectations 



At the beginning of this chapter, data on parents' expectations 
were reviewed (See also Table ^.1) and it was pointed out that parents 
of children In Cohort 2 have about the same expectations as parents of 
children In Cohort 1. 



Parents' Expectations Compared 
to Children's Esgpectations 



Reading 

!nie degree of congruence between the expectations of parents and 
children is only what would be predicted In terms of chance. The eogoected 
number of matches is 23, with a standard deviation of 3.^* In 19 out of 
73 cases (26^) for reading there Is perfect agreement (in Cohort 1 the 
comparable figure is 31^), well within chance expectation. Unlike 
Cohort 1 where children were noticeably more optimistic than parents, in 
Cohort 2 there are equal numbers of disagreements In both directions. 
In 27 instances parents are less optimistic than children and in 27 
Instances parents were more optimistic. The patterns of disagreement, 
however. In the optimistic and pessimistic directions are very different. 

In almost all of the Instances where parents are less optimistic 
than children, the child looks for a 1 while the parent looks for a 2 
(22 of 27 instances). The marginals for parents and children are not 
alike. Over 50^ of children look for the highest grade while only about 



Table U.8 



Child's 
Ib^ectation, 
Reading, 
Tiae 1 





Parent ' s 
1 


E^ectation, 
2 3 


Reading Time 1 
k Total 


1 


11 


22 


k 




37 


2 


9 


7 






16 


3 


k 


10 


1 


1 


16 


U 


1 


1 


2 




k 


Total 


25 


ko 


7 


1 


73 


Percent 


3U.2 




9.6 


1.1* 





Percent 

50.7 
21.9 

21.9 

5.5 

100.0 



Table U.9 

Parent's Escpectation, Arithmetic, Time 1 

1 2 3 U Total Percent 



• 


1 


8 


12 


3 




23 


31.5 


Child's 
E^ectation, 
Arithmetic, 
Time 1 


2 
3 


8 


22 
5 


9 
2 




39 
7 


53.U 
9.6 




k 




2 




2 




5.5 




Total 


16 


hi 


Ik 


2 


73 




Percent 


21.9 


56.2 


19.2 


2.7 




100.0 
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Table If. 10 



Parent's E3(pectatlon» Conduct, Tine 1 



Child's 
Expectation, 
Conduct, 
Time 1 





1 


2 




k 






1 


5 


16 


k 


1 


26 


35.6 


2 


6 


8 


2 




16 


21.9 


3 


5 


15 


3 


1 


2lf 


32.9 


If 


1 


If 


2 




7 


9.6 


Total 


17 


k3 


11 


2 


73 




Percent 


23.3 


58.9 


15.1 


2.7 




100.0 
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one- third of parents do. Similarly a large percentage (3^) of parents 
have modestly optimistic expectations— a "2"— but 22% of children hold 
this expectation. Over 2?^ of children look for a 3 or a 4, compared to 
11^ of parents. Children's expectations are thus considerably more 
variable than parents ' . Children of both Cohort 1 and Cohort 2 hold 
more extreme expectations than parents. In Cohort 2 the variability 
operates in both directions (some children e:q>ect very low grades) 
whereas In Cohort 1 there was over-optimism only. 



Arithmetic 

There is more congruence between parents' euid children's expectations 
for arithmetic (4?^ exact agreement) than was true for reading, but again 
a matching test indicates this amount of agreement is not significant 
given the marginals. What appears to be greater congruence stems from 
the similar marginals of the two groups. Parents and children look for 
the same number of 2's (56^ and 3% respectively) and the only major 
difference between them is an imbalance between I's and 3's. About 31^ 
of children es^ect I's compared to 22^ of parents. About twice as many 
parents look for 3's as children (19^ va. 10^). Somewhat more pessimism 
is registered by parents than by children (33^ of parents are less 
optimistic about arithmetic than their child, coi^pared to 21^ being more 
optimistic^ but the differential optimism is not extensive enough to be 
statistically significant (x^ « 2.08, N.S.). 

A difference between parents in Cohort 1 and those of Cohort 2 is 
that parents of Cohort 2 differ in their marginals for reading and 
arithmetic, expecting higher reading grades. Parents in Cohort 1 held 
about the same expectations in each area. 



Conduct 

Expectations for conduct are remarkable both because of children's 
pessimism— U3^ of them expect a "3" or a "1+"— and because of what appears 
to be a lower degree of congruence between parents and children (22%) 
than was true for arithmetic. Actually given the two marginal distributions, 
the degree of congruence is well within the range of what would be 
predicted by chance. Again parents pick the "safe" alternative, a "2", 
and again children resort to the extremes of the distribution more often. 
As well as many expectations for 3's and 4's, about 36% of the children 
look for I's. The asymmetry in the table points toward children being 
less optimistic than parents but this imbalance is not statistic«.lly 
significant (X = 1.12, N.S. ). 



Difscrepance Between Parenus' Initial E:Q>ectation 
and First Report Card 

Reading 



There is remarkable overall concordance at the aggregate level 
between parents' marginal distribution and the distribution of teachers' 



Table k,ll 



Reading MB^k, Time 1 
12 3 Total 
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Eag^ectatlon, 
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Time 1 
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Table U.12 

Arithmetic Nark, Time 1 
12 3 Total Percent 
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2 


3 


39 


Arithmetic 










lit 




Time 1 


3 


2 


6 


6 


19.7 




k 






2 


2 


2.8 




Total 


16 


U3 


12 


71 






Percent 


22.5 


60.6 


16.9 




100.0 
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Table If. 13 

Conduct Mark, Time 1 
123 Total Percent 





1 


5 


10 


1 


16 


22.3 


Parent ' s 
Expectation, 


2 


8 


27 


8 


h3 


60.6 


Conduct, 












lU.l 


Time 1 


3 


2 


5 


3 


10 




k 




1 


1 


2 


2.8 




Total 


15 


h3 


13 


71 






Percent 


21.1 


60.6 


18.3 




100.0 
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marks— the same number of l*s are awarded as parents estpect, no U's are 
given as parents expect, and the numbers of 2's and 3's awarded are close 
in the two distributions. In other words the pattern of marks awarded 
conforms with what parents overall expect. On an individual level, 
however, as particular parents and children are matched, there is not 
much agreement between expectations of parents and children's first 
reading mark. There is a little more, but not significantly more, 
concordance between parents expectations for reading marks and children's 
performance than would be expected by chance (35 cases (out of 71) of 
agreement— Just about 50?^— where the expected agreement is 31.63). Thus 
23 parents look for A's and 23 children are awarded A's, but in only 8 
instances do the awards Euid parents' hopes for A's coincide. 



Arithmetic 

There is concordance at the aggregate level for arithmetic marks 
also, the marginals for parents' expectations looking very much like the 
marginals for marks awarded. At the individual level, over 50^ of the 
expectations are exactly matched by the marks awarded— as was the case 
In reading— but since the number expected under the matching model is 
29*39 for arithmetic with a standard error of 3* 51* the observed value 
of 36 only approaches significance (z = 1.68, p < .10). There is thus 
slightly more agreement between what par.^nts expect and what happens in 
arithmetic. 



Conduct 

In conduct the concordance between parents' e3^ectations and marks 
received is of the same variety as in reading or arithmetic and not 
significantly different from chance. The mark and parent expectation 
marginals are again very similar. 

In summary , the children of Cohorts 1 and 2 hold somewhat different 
expectations initially and teachers' marks for Cohort 2 seem more severe, 
especially In reading. Parents' expectations are very similar from 
cohort to cohort, and this leads to a mild disagreement between expecta- 
tions of parents and children in Cohort 1 but not in Cohort 2. 
Surprisingly, kindergarten teachers' forecasts are poor predictors of 
performance In first grade, significantly worse than chance level. 
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CHAPTER ? 

RESULTS FOR TWO FIRST-GRADE COHORTS (1971-72, 1972-73), 
WHITE MIDOLE-CLASS SUBURBAN SCHOOL 



This analysis permits a sunming up of findings present in the two 
white middle-class suburban cohorts, and In some Instances, an opportunity 
to look for effects that were too small or too cooplex to be clearly 
shown in the separate cohorts. As mentioned, we have assumed time-of- 
sampling to be negligible, that is, we assume differences between 
cohorts reflect differences in groups of children making up the cohorts 
rather than differences attributable to time- of- sampling— fall, 1971 
vs. fall, 1972. If results for the two cohorts conflict, our conclusion 
will be that the findings are uninterpretable with respect to develop- 
mental change. Where findings agree, data have been combined so 
associations and trends can be more reliably measured, and we assume 
that such findings do represent developmental trends. 



Tabular Summary 

Means and standard deviations for the variables used in subsequent 
analyses and cross- tabulations are given in Table 5.1. The mean IQ for 
183 children (those in the sample, Q2% for whom IQ data were available 
from school records) is 113. 5> with a standard deviation of 11. 

A number of conclusions, borne out in both cohorts separately, are 
even clearer in the data for combined cohorts. 

(1) Expectations were obtainable from 86^ of parents. All were 
contacted at least twice. Parents have lower expectations than children 
for reading and arithmetic, but only the difference for reading approaches 
significance (t = 1.93 for the combined cohorts). Parents' expectation 
levels are uniform across subject areas when the two cohorts are combined. 

(2) Children in Cohorts 1 and 2 show significant differences in 
their expectations, those in the second cohort being lower. For reading 
the difference (0.I6) is small in absolute terms and not significant 

(t = 1.17). For arithmetic the difference is still fairly small (O.28, 
about a quarter of a grade point) but significant (t = 2.26, p < .05). 
For conduct the difference is largest, being 0.U5, and this difference 
is highly significant (p < .01). The differences between cohorts are 
such as to make children's and parent's expectations more consistent in 
Cohort 2 than in Cohort 1. 

(3) At the middle of g^'ade one children have highest expectations 
in reading; next highest in arithmetic, and least in conduct. These 
differences between subject areas are consistent in both cohorts, but 
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only the reading-conduct difference reaches significance (p < .05) in 
the combined cohorts. ' Dirt'erences between areas are tiny in Cohort 1. 

(k) Children's expectations in reading and conduct increase 
significantly over the first-grade year (t = 2.60, p < .05; t = 3.60, 
p < .01, respectively for the combined cohorts). Those for arithmetic 
increase also, but not to a statistically significant degree (t - 1.74, 
p < .10 for the combined cohort). The smaller increase for arithmetic 
escpectations results in the children's arithmetic expectations being 
the lowest of their expectations at the end of first grade while 
conduct expectation was lowest in the middle of the first grade. 
Both cohorts individually show the same pattern of relationships, 
the only difference being that the second cohort shows a more pro- 
nounced increase in arithmetic expectations between midyear and year 
end than does the first cohort. 

(5) Children's narks go up significantly in arithmetic over the 
year in both cohorts (Cohort 1, +0.16, t = 3.20, p < .01; Cohort 2, 
+0.2U, t = 3.75, P < .01) as well as in the combined cohort (+0.21, 
t s= U.93, p < .01). Reading marks remain relatively stable over the 
year in both cohorts. Conduct moves down in the first cohort, up in 
the second and for the two combined is almost stable. 



Before the First Report Card What Do Children Expect? 

First, a word is needed about how marks are assigned. At the first 
marking period (around the end of November) first graders do not receive 
report cards. At the end of the first semester (around February 1) 
they receive a report. The card says: "This report is designed to 
measure the progress of your child in terms of his own maturity and 
ability. Comparison with other children or groups of children is 
avoided." Thus childrexi being Judged in terms of what they are 
theoretically capable of doing-- their performcuice, in other words, is 
assessed with IQ partialled out. 



Discrepaiicies Between Initial Escpectations 
and the First Report Card 

As was true for each cohort taken separately there is not a 
significant degree of matching in the combined cohorts between child- 
ren's expectations and the marks they actually receive on their first 
report card in reading, arithmetic, or conduct. (As can be seen in 
Table 5.1 the variability in expectations exceeds that in marks.) 
A high percentage of children expect A's (55^ eacpect A's in reading, 
for exai^ple) and also a sizeable percentage (6^) expect failure. 
Actually no one fails, (in conduct 2 children are given failing grades, 
whereas 15 {9^) expect tnem. ) As can be seen in Tables 5.2, 5.3, and 
5.4) the outstanding characteristic in every area is the large number 
of children expecting I's who get 2's, far outbalancing the number who 
get I's but e3<pect C's. Tliis kind of over-optimism is the principal 
source of disagretinent between e;q)ectations and first narks. 



Table 5.2 



Reading Mark, Tine 1 
12 3 Total Percent 





1 


30 


55 


10 


95 


55.2 


; Reading 


2 


23 


21 


2 


k6 


26.7 


^. Esqpec tat Ions, 














'M Time 1 


3 


3 


13 


if 


20 


11.6 




If 


3 


6 


2 


11 


6.1* 


.4 


Total 


59 


95 


18 


172 






Percent 


31*. 3 


55.2 


10.5 




100.0 



Table 5.3 



Arithmetic Mark, Time 1 
1 2 ' 3 Total Percent 





1 


17 


53 


5 


75 


1+3.6 


Arithmetic 


2 


llf 


kl 


9 


6k 


37.2 


Expectations, 














Time 1 


3 


if 


18 


3 


25 


11*. 5 




if 




5 


3 


8 


h.7 




Total 


35 


117 


20 


172 






Percent 


20.3 


68.0 


11.6 




100.0 
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Table 



1 

Conduct 2 
E^ectations, 

Time 1 3 

k 

Total 
Percent 



Conduct Mark) Tine 1 
12 3^ Total Percent 



33 


35 


11 


1 


80 


U6.5 


11+ 


21 


9 




Uk 


25.6 


8 


19 


6 




33 


19.2 


2 


10 


2 


1 


15 


8.7 


57 


85 


28 


2 


172 




33.1 


k9.k 


16.3 


1.2 




100.0 



Discrepances Between Expectaticns and Report Cards 
at End of First Grade 



By the end of the first grade children show some inprovement in 
their ability to forecast narks accurately. For reading ^3% of 
children (z » 2.82, p < .01) are correct, and for arithmetic k6jo 
(z s 1.6^, p<.10) are correct, the latter result not attaining 
conventional significance levels. The picture for conduct resembles 
that for arithmetic {kSi, z « 1.71, P < .10) . (At midyear the children 
had accurately forecast their mark in 32^, 3^, and 3^ of the cases 
for reading, arithmetic, and conduct respectively.) 

For cohort 1 alone, no significant matching was observed while 
in cohort 2 only reading showed matching significantly greater than 
chance between the children's year end expectations and marks (t » 2.89, 
p < .01). 

The discrepance between year- end marks and extpectations will be 
treated in more detail later in this chapter in relation to other 
variables. 



How Do Children's Expectations Change 
Over the First-Grade Year? 



It has already been pointed out that the average level of children's 
e3q)ectations in all three areas increases over the first-grade year. 
I!ow does this change appear at an individual level? Muah of what follows 
aidresses this question by looking at changes in expectations in 
relation to marks received, to the e^ec tat ion-mark discrepance at 
mid-year, to parent-expectations, and the like. !Ihe slaplest question 
is to ask how much matching at an Individual level there is at midyear 
and at the end of the year, and this question will be answered first. 

About kSff, of children hold the same expectations for reading at 
the end of the year as in the middle of the year, as Table 3»5 shows. 
While UO^ would be expected to show the same expectation by chance., 
the amount of persistence observed is too large to attribute to chance 
(z s 2.18, p < .0^). More important, those whose expectations change 
tend to move up--60^ move up while move down—although the trend 
does not quite attain significance (X^ = 3.18). 

In arithmetic the picture of persistence is more pronounced— 
kSffi have the same expectation at midyear and at the end of first grade. 
Because of the differing margins in arithmetic conqpared to reading, 
however, this rontchinH (37^) exceeds chance expectancy by a larger 
amount, and in fact is highly significant (z » 3*62, p < .01). There 
is no significant tendency for expectations in arithmetic to rise or 
fall over the year, although ^0^ of those who do change move up. 
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Table 5.5 



Reading Expectation, Time 2 



Reading 
Expectation, 
Time 1 





1 


2 




4 


Total 




1 


57 


33 


k 




9i* 


54.0 


2 


23 


23 






46 


26.4 


3 


k 


15 


2 




21 


12.1 


. k 


8 


3 


1 


1 


13 


7.5 


Total 


92 


7h 


7 


1 


17k 




Percent 


52.9 


1+2.5 


1* 0 


0.6 




100.0 



Table 5.6 
Arithmetic Escpectatlon, Time 2 







1 


2 


3 


4 


Total 


Percent 




1 


46 


25 


3 


2 


76 


43.7 


Arithmetic 


2 


21 


34 


7 


1 


63 


36.2 


Expectation] 












Time 1 


3 


11 


11 


5 


1 


28 


16.1 




k 


3 


1 


2 


1 


7 


4.0 




Total 


81 


71 


17 


5 


174 






Percent 


46.6 


40.8 


9.8 


2.9 




100.0 
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Table 5.7 



Conduct Expectation, Time 2 
1 2 3 1» Total Percent 





1 


ka 


25 


6 


79 


k3,h 


Conduct 


2 


27 


12 


5 


UU 


25.3 


Expectation, 














Time, 1 


3 


13 


lU 


8 


35 


20.1 




U 


7 


6 


2 


1 16 


9.2 




Total 


95 


57 


21 


1 17^* 






Percent 


5»*.6 


32.8 


12.1 


0.6 


100.0 
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Rather surprisingly there is no more correspondence between 
children's expectations in conduct at the two time points than would 
be predicted by chance— Uo^ give the same guess on the two occasions. 
Approximately 36^ would be e^^ected to give the same guess by chance. 
There is, however, a highly significant upward movement of expectations 
for conduct over the year. About 66^ of those children whose 
expectations change show an increase over the year, a movement that 
is highly significant (x\ = 10.37, P < .01). A large increase in 
esepectations for conduct marks occurs— about half a grade point- 
as we have already seen, and this ii^provement is manifest over the 
entire marking range. 

To sum up: Children's expectations in reading and arithmetic 
are quite stable over the year and tend to change little. In both 
cases when there is a change, it tends to be in an upward direction 
but only the trend for arithmetic is significant. For conduct there 
is noticeable and significant change overall and it represents a 
highly significeuit upward movement. 

How Do Expectations Change in Relation to 
Initial Expectation Level 



As Tables 3,3, 3,6, and 3*7 show and as the x -tests for direction 
of change also document, there is a much stronger tendency for 
e^cpectations to m./e up than down. For children with low expectations 
at midyear (3's or 4's), there is a marked movement upward. For those 
with es^ectations of 2 and 3, where movement could occur either up or 
down , there is no downward movement for reading, and very modest 
amounts for arithmetic (9 cases out of ^1) and conduct {3 cases out of 
39) » Thus using only data not subject to floor or ceiling effects, 
the upward thrust appears strong. 



How Marks Compare in Different Areas 



First Report Card 

^4arks are assigned in all three areas on a range from 1 (high) to 
h. The teachers' marking range is restricted, however. Of 178 cases, 
no U's are given in either arithmetic or in reading at the middle of first 
grade, and only one U (in arithmetic) is given at the end. As remarked 
ea: lier, average marks are relatively stable over the year in reading 
and in conduct, but there is a significant increase over the year in 
the average mark in arithmetic. 

Marks In this school are given in relation to the student's 
supposed ability rather than in relation to others in the class or to 
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Table 5.8 



Arithmetic Mark, Time 1 
2 3 Total Percent 



Reading 



1 


22 


38 


1 


61 


3h.3 


2 


13 


72 


12 


97 


5h.5 


1 












3 


1 


10 


9 


20 


11.2 


Total 


36 


120 


22 


178 




Percent 


20.2 


67.k 


12.1* 




100.0 



Table 5.9 



Arithmetic Mark, Time 2 





1 


2 


1 




27 


2 

Reading 


23 


59 


Mark, Time 2 ^ 


2 


U 


k 






Total 


66 


97 



Percent 37.8 53.9 



I 
5 

7 
I 
lU 
7.8 



1 

0.6 



Total 



Percent 



71 


39.^ 


88 


48.9 


20 


U.l 


1 


0.6 


180 






100.0 




some absolute standard of achievement. Issues related to the marking 
practices will be taken up in the Discussion but it is worth noting 
here that with almost 90% of marks in both reading and arithmetic in 
the highest two categories as of the first report card, there is 
little room left for upward movement. 

There is greater consistency between marks in these two subjects 
than would be e:?pected, for 99^t of the marks in reading and arithmetic 
are within one unit of each other. As shown in Table 5.8, significeuitly 
niore students (57.9^) than would be e^^ected by chance get exactly 
the same mark in both subjects (z = U.ll, p < .01). Of the k24l% who 
do not get the same mark, 28.1^ show a significant tendency (x* = 9.99, 
p < .01) for the arithmetic mark to be one unit lower than the^ reading 
mark, rather than one unit higher (12.9^^). (Elsewhere the tendency 
of teachers to "mark harder" in arithmetic will be noted.) 

The present analysis makes clear that the variance of teachers' 
marks within children is less than would be escpected, even allowing 
for the restricted range. Given the margins, 12 children would be 
esgpected to have marks separated by 2 units; only 2 such children 
are observed. !nie numbers are too small, of course, to lead to firm 
conclusions, but the data are in a direction consistent with a "halo" 
effect. Teachers are inclined to rate the same child similarly in the 
different areas. 

There is significant matching of marks above chance level between 
conduct and both reading and arithmetic (^8.9^, z s 2.1(9, p < .01 for 
reading; 5^.5%, z 3.89, p < .01 for arithmetic). In both cases 
there are about as many children as would be predicted whose conduct 
and substantive marks are 2 units apart. There is no significant 
trend for marks In either reading or arithmetic to be higher or lower 
them marks in conduct. 



Marks at End of First Grade 

The exact matching of marks in reading and arithmetic increase 
about 3% from the middle to the end of first grade. Over three- fifths 
(60.6^) of children (N=l80) get the same mark in both subjects at the 
end of first grade (Table 5.9). This correspondence Is highly sig- 
nificant (z = 5.29, p < .01). Tte tendency for marks in reading to 
exceed those in arithmetic that existed earlier disappears by the end 
of first grade. Teachers display about the same marginal distributions 
of marks for both subjects (about hO^ A's and 50^ B's in both) at the 
end of the year, whereas at midyear arithmetic was marked more stringently 
i20jl, A's in arithmetic vs. 3^% In reading). 

At the end of fi>'gt grade there is still significant matching 
between marks at midyear both in conduct and In the two substantive 
areas (i+6.7^, z = 2.02, p < .05 in reading; kS.3i, z = 2.U, p < .05 . 
In arithmetic) . Percentages matching decrease in both areas, however, 
over the course of the year. These decreases are not significant, but 

' /: ». 
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deserve attention because they are opposite In direction to the 
original observi.blon of agreement between the two substantive areas. 
There Is no distinct trend for marks to be higher or lower In conduct 
than In the substantive areas. There Is also less clustering around 
the main diagonal than In the reading or arithmetic comparison, with 
11 and 6 children whose reading and arlthnstlc marks, respectively, 
are two or more units distant from their conduct mark. This corresponds 
closely to the midyear data where 9 and 10 children respectively had 
conduct marks 2 or more units distant from their substantive area 
marks. Thlj number of children In small, but together with the 
reduction of exact matching. It probably signifies that teachers 
make more "fine" distinctions between academic performance and class- 
rocaa social behavior as the year progresses. 



How Expectations Conqpare in Different Areas 



At First Grade, Midyear 

Although marks (and thus teachers) signify that performance levels 
in reading and arithmetic are correlated, children do not anticipate 
this association. Children who think they will do well in reading at 
the first report card are apt not to be those who think they will do 
well In arithmetic. 

Guesses before the first report card match in the two areas in 
only 23% of the cases and by chance 36^ matches ./ould be predicted 
(Table ^.10). This imder-matching between reading and arithmetic is 
highly significant (z=3.29> P< .01). Children whose expectations 
in reading eind arithmetic do not match expect to do worse in arithmetic 
than in reading— expect to do worse vs. 31^ expecting to do better— 
a difference significant at about the 3% level (X^ - 3.8l) when only 
unmatched cases are tested. 



A End of First Grade 

At the end of first grade significant undermatching is still present 
(z = 2.60, p < .01), although more children than earlier (33^ vs. 23%) 
escpect the same mark in the two subjects. Chance prediction is k2% 
(see Table 3>11)> In fact the discrepance from chance is roughly of 
the same magnitude as the discrepance at midyear. More guesses still 
place arithmetic below reading but, unlike results for midyear, the 
tendency is not strong enough at the end of the year to attain significance. 

Tn view of the ubove, conduct marks could not agree well with 
both arithmetic (uid reudinK. A' jidyear there is a slgnificai t excess 
of matches between reading and conduct {hk% match compared to ik% 
predicted, z - 3. 73, p < .01) and a significant deficit in matches 
between arithmetic and conduct {23rp match compared to 33^ predicted. 
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Table 3.10 



Arithmetic Expectations, Time 1 





1 


2 


3 


h 


Total 


Percei 


1 


ho 




10 


3 


102 


51*. 8 


2 

Reading 


27 


6 


Ik 


1 


U8 


25.8 


Esqpectations, 










23 


12.4 


Time 1 ^ 


9 


9 


1 


k 


k 


6 


3 


i» 




13 


7.0 


Total 


82 


67 


29 


8 


186 




Percent 


1+U.l 


36.0 


15.6 


l*.3 




100.0 



Table 5.11 

Arithmetic Expectations, Time 2 





1 


2 


3 


4 


Total 


Percei 


1 


38 


47 


8 


2 


95 


51.6 


2 

Reading 


45 


21 


12 


2 


80 


43.5 


Expectations, 










8 


4.3 


Time 2 


3 


3 


1 


1 


4 


1 








1 


0.5 


Total 


87 


71 


21 


5 


184 




Percent 


47,3 


38.6 


11.4 


2.7 




100.0 
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2 = 2.3*t| P < .05). Children show a significant tendency (x* « 5.95, 
p < .01) to believe they will do worse in conduct than in re&dlng, for 
62^ of the children with unmatched expectations guess a lower mark in 
conduct. In arithmetic about as many children believe they will do 
worse in conduct than in arithmetic as believe they will do better 
(5h% vs. lte?&). 

At the end of first grade, the significant matching between 
guesses for reading and for conduct persists (50^ of the guesses match 
coiqpared to kyf, expected by chance, z = 2.03, P < .05). Although the 
absolute amount of matching has increased over the semester (from 
kk% to 50^) , the margins change in a way to increase the number 
expected by chance (from 35^ to kSlo). The "excess" matching at 
midyear is 9^ and at the end of first grade is 7^. In absolute terms 
then, there is a decrease in over-matching during the time period. 
There are roughly equal numbers who guess they will do better or 
worse in conduct compared to reading {h% vs. 52^ respectively). 

By the end of first grade there are about as many matched 
guesses between arithmetic and conduct as would be escpected by chance 
(33^^ are noted and hO^ would be expected) . Also guesses above and 
below are roughly equivalent — of the unmatched cases 53^ expect to 
do better in conduct and hTi> expect to do better in reading. 

To sum up: These children appear to start school with very 
high expectations overall, but particularly in reading. At first 
report card 55^ expect A's, and although by the end of the year only 
52^ expect A's, less than % expect less than a B. These children 
are clearly optimistic about the likelihood they will do well in 
reading and a year's esqperience in school does nothing but increase 
their optimism. Hopes in arithmetic are not quite as high, although 
almost half the children e^gpect an A on the first report card. The 
most noteworthy difference is that a significant fraction— about one- 
seventh — look for a C-tr D in arithmetic at year's end. 



What is Iii3>act of First Mark on Expectations Later? 



If a child got an A in reading, in 72^ of the cases he expects 
an A at the end of the year. If he got a B, in 5H of the cases he 
looks for a B again but in almost the entire remainder of the cases, 
he looks for an A (Ul^). If he got a C, In only 10^ of the cases 
does he look for a C again; in h% of the cases he looks for a B and 
in h% of the cases he looks for an A.' 

Can the improvement in forecast accuracy over the year (see pre- 
ceding section showing more matching between marks and expectations at 
end-of-year Uwn at midyear) be accounted for in terms of improved 
accuracy of only those who receive the higher marks? Ihat is, given 
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Table 5^.12 



Reading 
Mark, Time 1 



1 

2 
3 



Total 
Percent 



Reading Mark, Time 2 
12 3 1^ 



k2 
2k 
1 



16 

55 
Ik 



Ik 
k 



67 85 18 1 
39.2 1*9.7 10.5 0.6 



Total Percent 



58 

93 
20 

171 



33.9 
54.U 

U.7 



100.0 



Arithmetic 
Mark, Time 1 



Table 5.13 

Arithmetic Mark, Time 2 
12 3 1* 

29 k 1 
33 78 3 1 
3 10 9 



Total 65 92 13 



Total Percent 



31* 
115 
22 

171 



19.9 
67.3 
12.9 



Percent 



38.0 53.8 7.6 0.6 



100.0 
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the relative stability of marks between midyear and year end (68^ of 
the children receive Identical reading marks both times; 99^ receive 
reading marks less than 2 points away from their midyear mark) It would 
seem that following anything other than a rule such as "if you got a 
C this time (midyear) expect a C next time (year end)" would lead to 
Inaccuracy. Since accuracy improves, it mlg^t seem that children with 
C's who expect A's and B's are not the ones providing for any iioprove- 
ment. 

If we look at Table 3*12 however, we see that accuracy can be 
improved by children who get C's. We see that 70^ of the children 
who received a C a mid year in reading get a B at year end. Their 
marks actually improved which means expecting an improvement should 
increase accuracy for them. It would be Incorrect to conclude that 
only the children with A's and B's are producing the improvement in 
grade expectation matching despite the seeming contradiction that 
children with initially low marks maintain relatively nigh expectations. 

One should note that the children who receive C's at midyear are 
atypical in that the mark they should e:Q>ect at year end is. not the 
mark they received at midyear, as is the case with children who 
received A's and B's. The relatively small number of children that 
receive C's means this distinction can go unnoticed if one simply 
uses "percent receiving identical marks" in the whole table, as above. 

Of the children who received C's at year end, jQf) had received 
a B at midyear. The probability of getting a C after having a B at 
midyear is smaller than the probability of getting a C after a C at 
midyear (.13 as compared to .20) but the larger number of cases to 
which the former applies results in a larger proportion of the year 
end C's having a midyear 6 (lU cases) rather than a midyear C (U cases). 
In reading only 22% of the children who received a C at midyear 
received r C at year end. More is said about this differential 
accuracy in the next section. 

In arithmetic the seime tendencies appear but are less marked. 
If a child got an A (Nb3U), he gets an A again in Q% of the cases. 
If he got a Q, in h% of the cases he gets a lower mark; in 20^ of 
the cases he gets a higher mark. With a C mark again there is 
pronounced reason for expecting a higher mark for 67% receive a 
higher mark. 

The picture for conduct is highly similar— of those given C's 
(N=29), more than 30^ iiqprove. 

In reading, arithmetic, and conduct, of the children who receive 
C's at midyeai' the largest number receive B's at year end (70^, 
and 3^ respectively) . This means that children with the lowest 
grades should be optimistic if their forecasts are to be accurate. 
Downward recruitment from the B category is not as pronounced for 
arithmetic and conduct as for reading and this produeces a decrease 
in the number of persons who receive C's at the year end coapared 
to mid year. 
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How Does Children's Expectation-Mark Agreement Vary 
with the First Mark Received? 



In order to discuss tables like Table ^,lk, of which many other 
examples will appear in the remainder of this chapter and in the next 
chapter, we must first acquaint the reader with the possible patterns 
of outcomes, and the substantive significance of these patterns. First, 
of a student receives a mark of 1 his expectation can only be equal to 
or be lower than the mark. Only values 0, +1, +2, and +3 on the Mark- 
versus-E^pectation variable are possible for such a person. Similarly, 
with a mark of k expectations can only be equal to or higher than the 
mark. Only -3, -2, -1, and 0 are possible values of the Mark-versus- 
Expectation variable. For a mark of 2 only -1, 0, +1, and +2 are 
possible and for a mark 3 only -2, -1, 0, +1 are possible values. 

Second, one must identify the pattern of outcomes esepected if 
children were a) able to Judge accurately or test the reality of their 
performance and adjust their expectations accordingly, or b) acting 
independently— without any feedback or knowledge— of school performance. 

If children formulate their escpectations independently of actual 
performance (marks) the distribution of expectations would be the same 
for all mark categories. It would, furthermore, be the same as the 
overall expectation distribution. For example, with an overall 
expectation distribution of 1 = 55^6, 2 s 2^, 3 = 12^, and U = 656 
we would es^ect the following pattern. 



Mark 



Mark-Expectation Prediction, 
Independent of Performance 
(percentages along rows) 





-3 -2 


-1 


0 


+1 


+2 


+3 




1 






55 


27 


12 


6 


100 


2 




55 


27 


12 


6 




100 


3 


5? 


27 


12 


6 






100 


k 


55 27 


12 


6 








100 



If, on the other hand, children were adjusting their expectations 
according to their accurate perceptions of their actual performance 
the bulk of the cases would fall in the column headed "0" (i.e. no 
mark- expectation difference) for each of the mark categories, and most 
of the remaining cases would cluster in the -1 and +1 columns (i.e. 
small mark-expectation discrepancies for all mark categories). The 
actual amount of clustering cannot be predicted, of course, from the 
assumptions given above, so the percentages presented in the following 
table are hypothetical and made roughly consistent with the previous 
example for ease of comxsirison. 
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Mark-Expectation Prediction, 
Assuming Performance Monitoring 
(hypothetical percentages along rov) 



Mark 



2 



1 



3 
k 



-3 -2 -1 0 +1 +2 +3 

55 27 12 6 

19 55 20 6 

6 20 55 19 
6 12 27 55 



100 



100 



100 



100 



Under the "chance" or "marginal probability" assvinption the bulk 
of the cases fall on the diagone.1 that rises from the lower left comer 
of the table and a declining set of percentages equivalent to the 
marginal percentages is expected in the column headed "0" discrepance. 
Under the "reality testing" assumption the bulk of the cases fall in 
the coluEsn headed "0" discrepance. Note that the two assumptions give 
similar shaped distributions in the row headed. "Mark = 1." In any 
given case however, it may be possible to decide between the reason- 
ableness of the two assumptions by looking at the magnitude of the 
percent in the row » 1, column - 0 cell. If, for example, 95^ of cases 
appear in this cell the data are not inconsistent with "reality 
testing" assumption which only claimed the bulk of the cases would 
be found in this cell. Such a finding would be inconsistent with the 
"marginal probability" assumption which specified the magnitude of 
thi^ percent as approximating the overall percent of expected (i.e. in 
our exeunple 55^). 

With these two assurqptions and their resultant models in mind let 
U6 examine Table 5*1^ which presents the data for reading marks and 
expectations at midyear. The distribution of reading expectations at 
midyear was I's » 55^> 2's = 27^, 3*8 - 12^ and 4's » &fo. The reader 
will recognize these as the percents used in the "hypothetical" example 
above which can be used for a direct comparison. The data clearly tend 
to follow the "marginal probability" model. The bulk of the cases, for 
the 3 marks the teachers assign, fall on a diagonal decreasing to the 
left of the Mark = 1, Mark-Expectation - 0 cell. Furthermore, all these 
cells (51^> 53^» and 56^) approximate the 3^ expected on the basis of 
our overall percent of I's eicpected. The data do not perfectly fit the 
"marginal probability" model but approximate it more closely than the 
"reality testing" model (remembering the mark 1 distribution is 
consistent with both) . 

It appears, then, that reading performance (as Judged by teachers) 
is not information that children incorporate into their midyecu* reading 
expectation. This appears to be specifically so for those children who 
receive 2's and 3's,and possibly so for those who receive I's although 
here it is bard for the model to be wrong in the case of I's. 
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Table 



. Expectation-Mark, Reading, Time 1 
-3 -2 .-1 0 +1 +2 +3 Total Percent 



30 23 3 3 59 

2 585t 22^ llf^ ei, lOO^t 

Reading 55 21 13 6 95 

Mark, 

Time 1 o 56^6 11^6 22^ 11^ 100^^ 

10 2 U 2 Id 



3 
U 

Total 10 57 55 38 9 3. 172 
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By noting that those children with low marks have larger pro- 
portions of extreme and moderate discrepancies between their expectations 
and marks, one might take Table to show that children with the 
poorest marks are also the children who are poorest at forecasting 
their marks. The discussion of the "marginal probability" model which 
fit the data, however, suggests that all the children may be using 
an equivalent forecasting scheme which is independent of performance * 
Children who do better may be better at forecasting than children 
who do poorly but at this stage of the analysis it seems prudent 
to explain the finding on the basis of the simplest model which fits 
the data. 

Arithmetic 

For arithmetic the overall distribution of expectations is: I's = Uk%, 
2's s 37%t 3's = lU^, and U's B ^. As with reading the bulk of the 
cases seem to fall along a diagonal defending to the left of the mark ~ 1, 
discrepance > 0 cell, again supporting the "marginal probability" model. 
The majority of the cases clearly do not fall in the "0" discrepance 
column (which approximate the marginal distribution as the marginal 
probability would predict), but the percentages in the diagonal do not 
fit perfectly either. Most notably, the distribution in the mark s 3 
row show the bulk of the cases occurring between the positions our 
competing models would predict. This may indicate that some low per- 
formance children are accurately Judging their performance in arithmetic 
and lowering their expectations slightly (but not sufficiently to remove 
discrepance) , or it simply may be an artifact due to the small number 
of cases (20) involved. (See Table 3.13) 

In arithmetic, then, most of the evidence is consistent with an 
observation that children are not using their performance in arithmetic 
(as it is viewed by the teachers) in formulating their expectations. 

Conduct 

For conduct the overall distribution of expectations is: I's = kOf^, 
2's - 26^, 3's = 19^, and U's = 9^. The pattern is rather similar to 
that for arithmetic. The bulk of the cases fall on the diagonal (not 
the "0" column), supporting the non- reality testing model, but the 
diagonal percentages (^8^, Ul^, 39^) do not reproduce as closely as one 
might wish the expected percentage (U6^) . In this case it is the over- 
representation In the mark = 1, discrepcuice = 0 cell that seems aberrant. 
As mentioned earlier, if the percentage In this cell significantly 
exceeds the percent expected (the overall percent receiving I's), 
evidence Is thereby consistent with a reality testing model. Of the 
five cells on the diagonal or in the "0" column (neglecting the Uth 
row due to insufficient cases) four are consistent with the non- reality 
testing model (Ul^ vs. UO^; 2% vs. 26^; 39?^ vs. 46^; and 21^ vs. 1%^ 
and one may be more consistent with a reality testing model {^3$ vs. 
U6^ expected by non- reality testing), (see Table ^.l6) 
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Table 5*15 



E:q)ectat ion-Mark, Arithmetic, Tine 1 



-3 


-2 


-1 


0 


+1 




+3 Total Percent 


I 






^9?( 




11^ 


100^ 








17 


Ik 




35 








35^^ 








Arithmetic ^ 




53 


kl 


18 


5 


117 


Mark, 














Time 1 ^ 


2% 


l^5?t 


1% 


15?t 




100?^ 




5 


9 


3 


3 




20 


k 














Total 


5 


62 


61 


35 


9 


172 



Conduct 
Mark, 
Time 1 



3 

h 

Total 



Table 5.l6 
£:Q>ectation-Mark, Conduct, Time 1 



-3 


-2 


-1 


0 


+1 


+2 


+3 


Total 


Percent 








585t 
33 


2% 
Ik 


8 


2 


57 


lOO^t 






kli 

35 


2% 
21 


22i 
19 


12^ 
10 




85 


lOOf, 




39?^ 
11 


32^ 
9 


21% 
6 


n 

2 






28 




1 






50^ 
1 








2 


100^ 


1 


11 




61 


35 


18 


2 


172 
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In smmary . most of the data on reading, arithmetic, euid conduct 
support the statement that children of all ability levels nay formulate 
their expectations the same way before the first report card is 
received, in a way independent of performance as viewed by teachers * 



How Does Children's Expectation-Mark Agreement Vary with 
the Mark Received at the End of First Grade? 



By the end of first grade there is a noticeable movement toward a 
pattern of expectation-mark discrepancies consistent with the reality- 
testing model for reading for the students who get high marks. There 
is both the predicted piling up in the Row ~ 1, Column » 0 cell, and a 
marked shift in the Row - 2 distribution. For the Row s 3 children, 
results are mixed. It appears that optimisAi has moderated, but guesses 
about performance are still biased strongly in a positive direction. 
See Table ^.17. 

For arithmetic (Table 3*l8) there appears again a movement toward 
a pattern consistent with a reality-testing model. There is a slight 
piling \xp in the "0" column— an excess for Row « 1 and a mode for 
Row = 2. Again the children in the in the Row = 3 category seem aberrant 
and inclined to maintain the pattern manifest at midyear but the nuober 
of children (13) in this row is small so strong conclusions are not 
warranted. 

In conduct, unlike reading and arithmetic, there is the potential 
for a poor grade. Five children (2^) actually receive the lowest grade 
and 1% get C's or D's, a total of 1?^, whereas 11^ in reading and % 
in arithmetic get the lower two grades. 

Despite the fact that conduct should be more capable of monitoring 
by the child, it only shows a slight tendency toward reality-testing 
(see Table 3.19). The mark » 1, discrepance = 0 cell contains more 
cases (63^) than would be expected on the basis of the "marginal prob- 
ability" model (32^) and the mark » 3 row has its mode shifted one 
column away from when the marginal probability model would predict it 
to be (i.e. on the diagonal). Children who receive a mark of 2, however, 
exhibit almost exactly the pattern the marginal probability model would 
predict (50^t vs. 32%, vs. 35^6, 1% vs. 12^, and 1% vs. 1^). 

In sum: There seems initially, before the first report, a consider- 
able push for expectations to be high and for expectat i ons to be similar 
regardless of the child's actual level of performance a •. midyear . By the 
end of the second semester a noticeable though not extr-dme shift toward 
reality-testing has occurred for reading. Arithmetic und conduct, 
however, show only very slight departures from the "nazgJnal probability" 
model* These departures only minimally exceed those observed for 
arithmetic and conduct at midyear. At year end expc-Jtations remain 
largely independent of actual performance in arithmetic and conduct 
while a slight dependence Is exhibited in reading . 
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Table 5.17 



Eigpectation-Mark, Reading, Time 2 



-3 


-2 


-1 


0 


+1 




+3 Total 


Percent 


1 






6856 


29?6 


1^6 


1^^ 


100^6 








U7 


20 


1 


1 69 




2 




U3^ 


51^6 


6?6 








Beading 




37 




5 




86 


Mark, 




ei% 












Time 2 ^ 


32^ 


5?6 








100^6 




6 


12 


1 






19 




k 


100^ 












lOOff, 




1 










1 




Total 


7 




92 


25 


1 


1 175 





Table 5.l8 



Eo^ectation-Mark, Arithmetic, Time 2 



-3 


-2 


-1 


0 


+1 


•4-2 


+3 Total 


Percent 


1 






57^6 


38?6 


5^6 




100?6 








37 


25 


3 


65 










U3^6 


lk% 


3^ 




100^6 


Arithmetic ^ 




39 


Ul 


13 


3 


96 


Mark, 














Time 2 ^ 


58?& 


31^6 


23% 








100^6 




5 


U 


3 


1 




13 




U 


100^6 












100^ 




1 










1 




Total 


6 


^♦3 


81 


39 


6 


175 
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Table ^.19 



Esgsectatlon-Mark, Conduct, Time 2 
-3 -2 -1 0 +1 +2 +3 Total Percent 



^ IS k 63 

2 50^ 36^ 13^t 1^6 100?t 

Conduct U3 31 11 1 86 
Mark, 

Time 2 ^ 29?6 ^^3^ 2k<f, % 100?& 

6951 21 

u ^ ^ 20ia 100^ 

H 2 2 1 5 ■ 

Total 2 8 53 76 31 5 175 
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All children seem very optimistic before they get a first report 
card and optimistic in much the same way. Children whose grades are 
already at or near the ceiling cannot be more optimistic than their 
performance and this leads to a confounding of effects that must be 
acknowledged in intex^reting the findings. 



How Does Children's E^ec tat ion-Mark Agreement 
Vary with Expectation Level? 



Reading 

As already noted there is a marked optimism seen in children's 
initial expectations in reading for 5% look for A's on the first 
report card (3^^ receive one) and 82^ look for an A or a B. This overall 
optimism, however, disguises some interesting pessimism when expectation- 
mark agreement is broken down by expectation level (see Table ^.20). 
(The reader should note that by examining diagonals in this table, one 
can locate persons who received a particular mark. Thus reading down 
the right-most marginal, 30, 23, 3, 3» one has a summary of those 
children, 39t who received an "A" in reading on the first report card.) 

The majority of children actually receiving an A expected one 
(30/39 or 31%) but many more children expecting A's did not receive 
one (65/95 or 68?^). The majority of extraordinarily high hopes were 
therefore not confirmed. On the other hand, half the children expecting 
a "B" (23/U6 or 50^) actually received an "A," so children expecting 
B's are generally under- rather than over-optimistic. The under- 
optimism is even more noticeable among those children who expect C's 
or D's— only 13^6 of the total (V31) actually receive a C and none 
receives a D. Altogether 32^ of children forecast their mark exactly, 
and most of these get an A. The general trend seems therefore to be 
one of inaccuracy— the overall mean suggests an inaccuracy based on 
over-optimism but further analysis suggests both over- and under-optimism. 

The presence of both over- and under-optimism, while worth noting 
because of its substantive inqpact, is not entirely unexpected. If one 
disregards exact matching of marks and expectations, it is easy to see 
that children expecting a k can only underestimate. Likewise a child 
expecting a 1 can only overestimate (or "exactly" forecast his mark) . 
Those expecting 2's and 3's can either over- or underestimate but these 
categories can lead to extreme discrepancies occurring in only a single 
direction. Therefore, given some children expecting 3's and U's, the 
presence of some underestimation is guaranteed. 

One can ask whether children with naturally high expectations are 
superior (or inferior) at providing accurate forecasts. If children of 
all expectation levels showed no accuracy at all in forecasting their 
marks each expectation level would be assigned a mark distribution 
corresponding to the overall mark distribution (i.e. I's = 3U^, 2's = 55^, 
and 3's » 11^). This leads to the table of expected values shown below. 
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Expected Values for Coiqplete Lack of 
Forecast Accuracy 



1 



-3 



Expectation-Mark, Reading, Tl 
-2 -1 0 +1 +2 
11 55 3^ 



100?^ 
lOO^t 

lOQi 



Child's Reading 
Expectation, Tl 



2 



3 
k 



11 55 3^* 

11 55 3h 
11 55 



3h 



(The reader should note the parallel between this analysis and that done 
previously for children who yecei^-ed various marks.) A conparison of 
this table with Table 5*20 indicates a reasonable similarity, indicating 
lack of a superior forecast ability on the part of children at any of 
the expectation levels. Indeed, the most noticeable discrepance in 
terms of difference between percentages and the case base for the 
percentages occurs for the children who e^^ect a 2 and actually get a 
1. This cell is over • represented indicating these children are doing 
worse at forecasting than if their forecast bore no relation to their 
mark.' As a general statement, however, forecast ability does not seem 
to consistently vary with expectation level and the accuracy of fore* 
casting seems to be at about chance levels. While the above discussion 
is couched in terms of "forecasting ability" the reader should note it 
could Just as reasonably have been discussed in terms of the causal 
efficacy of e:q>ectations. If expectations hi^ly influenced marks, or 
even if different expectation levels had different effects on marks, 
deviation from the "chance" model would have been expected. That is, 
the mark distributions within each e:cpectation level should have deviated 
from the marginal distribution. In particular the column headed "zero 
discrepance" and those with small discrepancies (-fl or -1) should have 
been over- represented if expectations were displaying substantial causal 
efficacy. 

Arithmetic 

In arithmetic optimism is considerably less— only kk% look for an 
A (and of these only one in four receive an A). Overall, however, the 
high percentage looking for an A or a B (82^) resembles the picture 
for reading (82^). Of those looking for a B in arithmetic, however, 
the vast majority {kl/6h or 6U^) receive one, a situation which differs 
from that noted in reading where most received A's. Again one notes a 
degree of over-pessimism because of those looking for a C or a D— 19^( — 
only a few persons, 3 out of 33, receive even as low a mark as a C. The 
data are shown in Table 5*21. 

As with reading .we can ask for arithmetic whether children of 
different expectation levels display differential forecasting ability. 
Since the "marginal" distribution of marks is 12^ - 3's, 68^ - 2's, and 
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Table 3*20 



Expectations-Mark, Reading, Time 1 
-3 -2 -1 0 +1 +2 +3 Total Percent 

, 11^6 583t 32% lOQfH 

10 55 30 95 

2 H hen, ^Qffi, 100% 

Reading 2 21 ?3 ^6 
Expectation, 

Time 1 , 20% 6% 1% lOO^J 

^ it 13 3 20 

|. l83t 55?£ 2% 100ff> 

2 6 3 11 

Total 10 57 55 38 9 3 172 



Table 5.21 

Expectation-Mark, Arithmetic, Time 1 

-3 -2 -1 0 +1 +2 +3 Total Percent 

1 % ni 23% 100% 

5 53 17 75 

P 1^ ek% 22% 100% 

Arithmetic S kl Ik 6U 
Expectation, 

Time 1 ^ 12^ 72^t l6^ 100^6 

^ 3 18 U 25 

u 3^ (>3% 100% 

* 3 5 8 

Total 5 62 61 35 9 172 
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Table 3.22 



1 



Expectation-Nfiirk, Conduct, Time 1 

-3 -2 -1 0 +1 +2 +3 Total Percent 

1% Ihi kUi kl% lOO^t 

1 11 35 33 80 



2 20% 32?t 

Conduct 9 21 lU kk 
Es^pectatlon, 

Time 1 , Idf, 3% 2k% 

^ 6 19 8 33 

k 7% 13?& 67^t 13^ 

1 2 10 2 15 

Total 1 U 61 35 18 2 172 
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20^ - 1*8, this ia the distribution we would esqpeot in the rows of the 
table if children of all expectation levels displayed "ehanoe" levels 
of forecasting ability. Table 3>21 very closely approximates this 
(noting the small case base for an es^eotation of U). The data do 
not contradict a statement that children of all ability levels forecast 
at the chance level or, equivalently, that children's expectations are 
not differentially influencing the mark distributions. 

Conduct 

In comparison with the previous tables for reading and arithmetic 
two factors stand out in Table 3>22 which summarizes data for conduct. 
The first is the reduction in optimism— 2d^ look for C's or D's. The 
second is the wide range .In discrepancies— almost 19^ of the guesses 
are off by 2 or more units in either direction. This lack of accuracy 
is paradoxical in a sense because conduct is the area in which the 
child presumably has the most clear-cut feedback and in which he has 
most control over performance. 

Again to answer ,the question of whether or not children of different 
expectation levels display differential forecasting ability we must 
start from the marginal mark distribution: in this case 1^ receive U's, 
16^ - 3's, h9i, - 2's, and 33$ -I's. Coinparing Table 5.22 with the 
"chance expectation" table arrived at by placing this distribution in 
the appropriate portions of the rows of a table, shows again that the 
chance model fits quite well. Thus children of all expectation levels 
(with the possible exception of those expecting I's) are forecasting 
their marks at near chance levels. 

In summary , in all three areas both over and under optimism is 
present— the over optimism naturally occurs to children with high 
expectations and under optimism occurs to those with low expectations. 
In all three areas children of all expectation levels show only a 
chance level of forecasting their marks or, equivalently, in all three 
areas the children's differential expectations are not differentially 
influencing to mark distributions at midyear (i.e. the midyecur mark 
distribution is the same no matter what level of children's expectations 
is considered). THiia lack of a midyear effect contrasts with that 
noted in the next section concerning both midyear and year end observations. 



The Effects of Feedback 



Reading; Feedback Effect on Marks 

About 23^ cf the rr^rks in reading went up between the middle cmd 
end of the first grade, 3% remained stable, and ISffo declined (based on 
a sample of l65 children) . With one exception all movement was 
confined to a single step (i.e., from B to A, C to B, etc.). 
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Table 5.23 



Child Previously 
Did B,S,W Than 
He Expected (Tl) 



Change in Reading Mark, Tl to T2 





Up 


Same 


Down 




Better 


k 


32 


12 


kB 


Same 


10 


32 


9 


51 


Worse 


2k 


32 


10 


66 




38 


96 


31 


165 



Table 5.2U 
Predicted Reading Mark Change, Tl to T2 



Up Same Down 



Child Previously 
Did B,S,W Than 
He Expected (Tl) 
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If a child's year end mark io^roved over his midyear mark (N » 38), 
in 6% of the case» his mark at the middle of the year had been Igss 
than he expected. If a child's marks deteriorated (N « 31) in 3S^ 
of cases his mark was better than he had nxpeeted. The 3X3 table, 
Table 5*23, is a collapsed version of a 7 X 7 table that relates all 
possible types of mark changes to all possible types of mark-e3g;>eetatl(»i 
relationships at midyear. A test on the four corners of the table 
shows the effect of expectations upon later ^qprovement (or deterio- 
ration) in marks to be highly significant (X^ s 9.18, p < .01) given 
an initial discrepance between mark and expectation. Hhere is a 
significant tendency for marks and expectations to move toward con- 
sistency. But a more interesting coiqparison is one that can be applied 
to the entire table and one which has a oausal frame of r<^ferenee. In 
terms of causation, expectations being higher than the mark should 
lead marks to rise; expecstatlons being the sabe as the mark should 
lead to no change in mark; expectations being lower than the mark should 
lead the mark to fall. In a perfect causal system in which expectations 
were the only causes, all cases would fall on the "minor diagonal" of 
the table above if consistent with the pattern Just outlined. 

With the margins as given the expected values are those given 
in Table The cells on the minor diagonal (enclosed in rectangle) 

sum to 33*89, and their sum departs from the observed data in the 
direction predicted by the causal hypothesis— 9 more children (2U-13) 
have marks that rise, 3 more children have marks that fall, and 2 more 
have marks that stay the same. The easiest way to assess the "minor 
diagonal" effect is by the same statistic used earlier to assess the 
major diagonal (match) effect. Such a test here reveals a significant 
effect (z s 2.U3, p < .01). The magnitude of this effect is hard to 
assess but in that column representing those whose marks improve one 
can see that most (63^) are recruited from students who did not do as 
well as they expected earlier. 



Reading; Feedback Effect on Expectations 

Marks are affected by an earlier mark- expectation discrepance but 
the inertia of marks seems relatively large— there is a strong tendency 
for marks to stay the same no matter what the discrepa:ice. Expectations, 
though, are more responsive in an optimistic direction, for of those 
{kj) whose mark was higher than expected, 38 (8l^) raise their hopes. 
Expectations at the end of the year are more responsive to earlier 
marks than are the marks at year end. Thic is hardly surprising since 
a child's expectations are something he himself controls, whereas his 
mark depends on someone else's behavior. 

If a child receives a mark higher than he expected in reading at 
the end of the first semester (as 28^ of 166 children do), then his 
response is very likely (chances are better than U:l) to increase his 
expectations at the end of the second semester. If he gets exactly 
what he expected at the end of the first semester (as 33^ do) then 
he will be likely to keep that e3q>ectation (chances are 3 out of 3); 

* J * * 
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T&ble 3.25 



Change In 
UP 

Better 38 

Child Previously 

Did B,S,W Than Same 12 
He Expected (Tl) 

Worse 0 
50 



Reading Expectations, Tl to T2 



Same 


Down 




9 


0 


If? 


35 


7 




36 


29 


65 


80 


36 


166 



•'"J D 
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but if he does change, he is about twice as likely to increase his hopes 
as to decrease them. If he gets a lower mark than he hoped for (as 
39^ do), then he will not increase his expectations at the end of the 
second semester but he is somewhat more likely to hold to his original 
expectation than to lower his sights (chances are 6 out of 11). 

This analysis based upon combined cohorts presents a more coniplete 
picture of the response of esgpectations to previous marks than is 
available in either cohort and documents with more certainty the trends 
seen there. 

The matter can be explored most easily by condensing a 7 X 7 
table of discrepancies vs. changes in expectations to a 3X3 table. 
Table 3*23* First, the degree of matching above chance is highly 
significant (z = 8.0^, p < .01) and amounts to 6l^. Thus if a child 
received a higher mark theui he expected, his hopes tend to rise. 
If he got what he expected, his hopes tend to remain stable. If he 
got less than he expected, his hopes tend to decrease. But, and this 
is perhaps the most linportant part of the table, if he got less than 
he hoped, his hopes are more likely to remain fixed than to drop 
(36 vs. 29 children). That is, there is some tendency ^or these 
children's expectations to decline, but this chance of decline is still 
less than even. If he got more than he hoped, on the other hand, his 
hopes respond vigorously (36 vs. 9 children) and the odds for an 
increase are over U:l. If he got exactly what he e^gpected his hopes 
may change in either direction with perhaps some bias in the positive 
direction. This receptivity to good news and relative resistence to 
bad news ceui be texmed a "bouyancy effect." 



Arithmetic; Feedback Effect on Marks 

About 27% of children's marks in arithmetic Increase from the 
middle to the end of the first grade year, and less than ^ decrease. 
(In reading 23^ increased and 19^ decreased.) The number of cases 
decreasing in arithmetic is too small to Justify analysis. 

For children whose marks increase, 31^ had received a mark less 
than they hoped for earlier (compared to 36^ who received what they 
expected and 13^ who received more). Of those whose marks iisprove, 
then, the majority had hoped for a higher mark than they received. 
As before, a Z- value based on the minor diagonal can be used to test 
the hypothesis that marks and expectations movs In a way to improve 
their consistency, and the test is borderline (z = 1.^2, p < .10, one- 
sided). The magnitude of the movement Is less for arithmetic than for 
reading. 



Arithmetic; Feedback Effect on Eagectations 

Again, as was true for reading, children's expectations are more 
labile than marks In terms of the earlier (Tl) expectation-mark dis- 
crepance—some children (over 10^) raise or lower their expectations by 
2 or 3 units. Marks have a much narrower range of variation. 




Table ^.26 



Change in Arithmetic Mark, Tl to T2 



Child Previously 
Did B>8«W Than 
He Ea^ected (Tl) 





«? 


Same 


Dovm 




Better 


6 


32 


k 


k2 


Same 


16 


kl 


2 


59 


Worse 


23 


39 


2 


6k 






112 


8 


165 



Table 5.27 



Change in Arithmetic Esgpectations, Tl to T2 

Up Same Down 

Better 30 11 1 k2 

Child Previously 

Did B,S,W Than Same 13 35 11 59 
He E^^ected (Tl) 

Worse 2 36 2? 65 

1»5 82 39 166 



5 - 35 



Ihe easiest way to see overall trends in esg^eetation change as a 
function of marks received, is in a 3 X 3 table, Table 3.27 (sunnarized 
from a 7 X 7 table). Again the amount of matching (3^) is large and 
highly significant (z = 6.08, p < .01) when conqpared to the amount 
expected by chance {3h%). Also as earlier was true for reading, the 
majority (3^) of children who received negative feedback (did worse 
than es^ected) at least maintain their expectations at the same level 
despite the receipt of a mark lower than they expected. Also, as was 
true earlier for reading, there is a vigorous response to positive 
feedback— 71^ respond with raised hopes. If the child previously got 
exactly what he expected, about the same number of children raise hopes 
as lower them (13 vs. 11). 

With positive feedback, then, the majority response is to raise 
hopes. With negative feedback, a considerable fraction lower hopes, 
but a larger fraction maintain hopes at the same level. In sum, a 
bouyancy effect is observed for arithmetic. 



Conduct; Feedback Effects on Marks 

In conduct there is a positive shift in marks given at the end of 
the year— 22^ of children get higher marks waile 11% decline. Again 
trends are clarified by a 3 X 3 summary of the midyear discrepance 
between expectations and marks and mark changes (Table 3.26). There is 
a significant over~ representation of cases on the minor diagonal 
(z s 2.37, p < .01). The amount of matching is Ul.8^, or about more 
than chance prediction. There are about equal numbers of children with 
positive and negative discrepances (earlier in reading and arithmetic 
there were more children with positive discrepances, i.e., e^ectations 
higher than marks). 

Since there are some children whose marks decline (11^), one can 
also exeunine the effect of receiving a uiark higher than one esgpected. If 
a child's mark was higher than he hoped in conduct, his mark is less 
likely to shift down (15^t shifted down) than to remain stable (72^^ remain 
stable) but the probability of his mark shifting down is slightly greater 
than chance would predict. This picture replicates what happened for 
arithmetic. 

The distribution of changes in conduct marks is almost identical 
for the neutral and positive feedback cases— 8, Uo, 7 compared to 7, 39, 
8. With neutral and positive feedback, marks show a slightly but not 
significantly higher- than-chance probability of staying the same (UO vs. 
37 expected, 39 vs. 36 expected, respectively). Only if the discrepance 
is one where a child did worse than he expected is there a noticeable 
effect later. In this case marks tend to increase significantly (X s 
12. OU, p < .01). Since 21 cases display an increase out of 56, the*effect 
is also large in practical terms. Conduct would be an area where a child 
could increase the quality of his performance more readily than in 
arithmetic, say, and it is therefore sensible for the effect in conduct 
to be larger. 
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Table 5*26 



Better 

Child Previously 

Did B«S,W Ulan Same 

He Expected (Tl) 

Worse 



Change in Conduct Mark, Tl to T2 



UP 


Same 


Down 




7 


39 


8 




8 


i»0 


7 


55 


21 


32 


3 


56 


36 


111 


18 


165 



Child Previously 
Did B,S,W Than 
He Expected (Tl) 



Table 5.29 
Change in Conduct Esqpec tat ions, Tl to T2 





up 


Sane 


Down 




Better 


kl 


5 


1 


53 


Same 


15 


33 


10 


58 


Worse 


3 


28 


2U 


55 




65 


66 


35 


166 
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Conduct! Feedback Effect on Expectations 



The eacpectatlon changes for conduct from the middle to the end of 
the first-grade year resemble the picture for similar change In reading 
and arithmetic: much more lability characterizes changes In expectations 
than in marks. Again a 3 X 3 matrix makes changes easier to see (Table 
5*29). Almost equal numbers of children expected more, the same, or 
less than the mark they received, but of those who received more than 
they ejected there Is a strong tendency (89^6) to subsequently alter 
expectations up, In the direction of consistency. If a child got 
exactly what he expected, there Is movement both up and down, with no 
particular trend in either direction. But if a child got less than he 
expected, as before with reading and arithmetic, his expectations are 
more likely to remain stable than to decrease {36ji, do not decline). 
His response to negative feedback in the direction of consistency is 
much smaller than his response to positive feedback. A buoyancy effect 
is observed here also. 

There is a highly significant amount of over-matching (z s 8.03, 
p < .01) and the absolute level— 63^-is Impressive. In other words, 
for conduct if a child earlier has received a mark he expected his 
expectations tend to remain stationary. If he earlier received more 
than he expected, his expectations tend to rise, and conversely if he 
received less than he expected, his hopes tend to fall, but as already 
remarked, the tendency to fall is much less than the tendency to rise. 



Summary of Feedback Effects 



Effects on Marks 

In all cases the majority of children received the same mark at the 
end of first grade as at midyear of first grade. That is, for reading, 
arithmetic, and conduct generally and for all the breakdowns of each of 
these into distributions of students who did better, the scune, or worse 
than they expected, the majority in any breakdown can exPect to get the 
same mark at year end as at midyear . 

Tliere is also, however, a consistent (and significant in the case of 
reading and conduct) pattern for cases on the minor diagonal to appear 
with greater than chance e:5>ectatlons. More specifically, there is a 
consistent observation of the following types of cases occurlng more often 
than would be expected by chance: 

a) A child's marks going up at year end and after the child did 
worse than he expected at midyear . 

b) A child's mark remaining the same at year end after the child got 
the same mark he expected at midyear . 

c) A child's mark going down at year end after the child did better 
than he expected at midyear. 
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These observations are consistent with the notion that student 
expectations causally affect the marks students receive, even if the 
degree of association is rather small. As noted earlier, the restriction 
in range on the marking scale and the similar margins at mid- and end-of- 
year mitigate against effects being displayed (The power of tests is low). 



Effects on Expectations 

In all three areas (reading, arithmetic, and conduct) there is a 
significant tendency for children's expectations to move toward being 
consistent with the mark they received at midyear. If the midyear mark 
was above the children's expectations, their expectations moved up 
substantially; if the midyear mark equalled the children's expectations, 
their expectations showed less change than would be expected by chance; 
and if the midyear mark was below the children's expectations, their 
expectations showed a greater than chance tendency to decline but in all 
three cases (reading, arithmetic, and conduct) this decline was not as 
pronounced as the previous two effects (the majority of children main- 
tained their expectations at the same level it had been at midyear). 

When information on marks is combined with that on expectations, 
it looks as though when a child gets less than he hoped he does not 
lower his sights— his expectations remain the same and there is a tendency 
to bring marks in line. If he got more than he hoped, his e:qpectations 
immediately rise and his mark stays the same. There is altogether what 
might be called a bouyancy effect— a tendency for marks or expectations 
to rise to narrow the discrepance between them. If the child does better 
than he expected his hopes rise; if he did worse than he expected his 
marks rise. If he gets good news he is receptive, whereas he seems 
relatively impervious to bad news. (The strongest trends observed in the 
data are stated without qualification in the above summary— i.e. , without 
regard for weak reverse trends— for siiqplicity of presentation.) 



Children's Ability to Forecast 



How does discrepance at midyear coiqpare with the end-of year discrepance 
in the mark- e3Q>ec tat ion relation? 



Reading 

Children get better at anticipating their marks, particularly at not 
underestimating . A summary of a larger table showing how expectations in 
reading relate to marks in reading (higher, same, or lower) at midyear and 
at the end of first grade appears in Table 5.30. The percentage agreeing 
Jumps up noticeably from 31^ to To test the significance of this 

increase In agreement, note that 36 cases remain the same, ik cases move 
away from agreement and 52 cases move toward agreement. The test for 
related samples «ives u \mlue of L?0.7'», significant far beyond the .01 
leve], indJcfilinK Uiul. i.lic 'A% Increase In uKreement Is higlily signii'lcant. 
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Table 5.30 

Mark-Expectation Discrepance for Reading, Tl vs. T2 



Child Es^ected to Do B,5«W Than He 
Did in Reading 

First Semester 



Second 
Semester 





Better 


Same 


Worse 




Better 


30 


8 


13 


51 


Same 


30 


36 


22 


88 


Worse 


5 


6 


11 


22 




65 


50 


k6 


161 



Table 5*31 

Mark-Expectation Discrepance for Arithmetic, Tl vs. T2 



Child Expected to Do B«S,W Than 
He Did in Arithmetic 

First Semester 



Second 
Semester 





Better 


Same 


Worse 




Better 


23 


8 


10 


kl 


Same 


29 


3k 


Ik 


77 


Worse 


11 


16 


16 


k3 




63 


58 


UO 


161 
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Ihe result is also significant in practical terms for the majority of 
children move toward agreement. 

The previous pessimists and the previous optimists are about as 
likely to be recruited into the "agree-ers" category— of those who 
used to have too-high hopes now have their e3q)ectations in line and 
of those who used to have too- low hopes are now in line. More previous 
optimists are recruited (30) than pessimists (22) simply because there 
were more optimists to be recruited from (63 compared to U6). On over- 
shooting, there is a slight tendency for those who had hopes too low to 
move further up than the reverse— 28^ of those with too-low hopes 
earlier vs. 8^ of those with too-high hopes earlier modify their 
esipectations or their marks to such an extent that the discrepance 
between marks and expectations has now reversed its direction. 

A tendency toward optimism exists at the end of the year, as the 
margins of Table 3 •SO show, and the tendency strengthens when the 
Increased matching is allowed for. That is, in terms of absolute 
numbers of children being overconfident, kOji> at midyear and 32^ at the 
end of first grade expect a mark higher than they get. But it is more 
pertinent to see how those not matching divide up at both times. Of 111 
not matching at midyear, ^9% have expectations too high. Of 73 not 
matching at the end of first grade, 70% have expectations too high. 



Arithmetic 

A similar analysis to that Just give for reading can also be given 
for arithmetic. The first step is again to condense a larger table to 
a 3 X 3 table (Table 5>31)* Again the amount of matching Improves, from 
36^ to kdH,, and a chi-square test (Xj - ^.83, P < .0^) shows a significant 
movement toward agreement: The movement toward agreement in arithmetic 
is not as strong as it was for reading. A test of the significcuice of 
differences in movement (24^ vs. 12^) falls Just short of the % of 
significance level (xj = 3.53). 

From another steuidpoint also the changes in agreement between 
expectations and marks at the two time points for reading and for 
arithmetic differ. At the end of first grade, kSfflt are matched but there 
is not the over- optimism in arithmetic that there is in reading. Of those 
disagreeing (N = 8U) about the same nuiober overestimate as underestimate, 
kS% vs. 51%' The tendency toward over-optimism is reduced rather than 
augmented. 

These findings must be viewed in the light of teachers' marginal 
distributions of marks in arithmetic. As noted elsewhere, teachers 
marked "easier" at the end of first grade than in the middle, so it may 
be proper to attribute the apparent equality in over- and under- optimism 
on the part of students to the shift in the teachers' marginal distribu- 
tions of marks. If the teachers had assigned approximately the same 
marginals in arithmetic at the two time points (as they in fact did in 
reading), then presumably more children would have received a lower mark 
than they expected, leading to over- optimism. 



Table 5.32 

Mark-Expectation Discrepance for Conduct, Tl vs. T2 



Second 
Semester 



Child EKPeeted to Do B.StW 
Than He Did in Conduct 

First Semester 





Better 


Same 


Worse 




Better 


26 


18 




58 


Same 


22 


23 


25 


70 


Worse 


7 


12 


Ik 


33 




55 


53 


53 


161 
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Conduct 



A similar kind of analysis for mark-expectation discrepance Is 
given for conduct In T&ble 3 •32. First, does agreement between 
expectations and marks Increase? Tlae agreement at midyear Is 33% and 
that at the end of first grade Is k3%' This Is the smallest Increase In 
agreement noted so far, and Is the first not to attain significance 
(Xj s 3.32, N.S.). There Is, hovrever, as was true for reading, relatively 
more over-optlmlsm at the end of the year than there was In the middle. 
Of those who do not manifest agreement (N s 91), are over- optimistic 
vs. 31% earlier. There Is some Improvement In teachers' marginals for 
conduct over the year {B3% received A's or B's at midyear conpared to 
65% at the end of the year) but the loqprovement Is slight. Therefore 
In conduct the main findings are (1) the failure for agreement to 
Increase slgnlficeuitly and (2) a relative Increase In over-optlmlsm 
with time (like that seen for reading). 

To s\iffl up: Improvements In children's ability to forecast are 
hard to assess because If teachers' marking preustlces differ from 
one report card to the next, the target of the forecasts Is shifting. 
For reading, where teachers are consistent, there Is a large Increase 
In agreement between marks and e:g;>ectatlons In both statistical and 
preustlcal terms. For those whose e^ectatlons and marks are not In 
line at year end, optimism far exceeds pessimism. For arithmetic, where 
teachers mark more easily at year's end, there is a movement toward 
agreement which is less strong than the movement in reading. Any over- 
optimism is probably masked by the teacher shifts. For conduct there 
is the smallest movement toward agreement coupled with a tendency toward 
over-optimism. Children do Improve in forecasting, then, but not equally 
across the three preformance areas. 



Parents' Expectations Compared to Children's Expectations 

In both cohorts of middle class children, parents are optimistic 
but guarded. As noted in Table 3.1, parents' expectations are fairly 
stable across areas, being I.89, 1.93* and 1.93 for reading, arithmetic, 
and conduct for the combined cohorts. A large majority (6^^) e^Qpect 
a "2" in reading, a "2" in conduct (64^), and 59?& expect a "2" in 
arithmetic*. 

Cohort 1 children exceeded those of cohort 2 in optimism— signifl- 
ceuritly so in arithmetic and conduct (t = 2.67* P < .01 and t = 3'19> 
p < .01 respectively), not significantly in reading (t = 1.26). Only 
cohort 1 Children are noticeably more optimistic in their expectations 
than parents (t = 3.78, p < .01; t « I.89, p < .05; t = 2.38, p < .01 
for reading, arithmetic and conduct respectively for cohort 1— all cohort 
2 t values are insignificant). Over 3% of the children in both cohorts 
taken together expect the highest grades in reading and and k% 
expect A's in arithmetic and conduct respectively. 

The matching between esgpectations of parents and children is about 
what would be predicted by chance, the observed percentages matching 
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Table 5.33 

Child's Reading Ebgpectation, Time 1 

123^ Total Percent 



Parent Reading 
E3Q)ectation, 
Time 1 



2 

3 
k 



17 



57 



11 



15 



2k 



12 



2 

5 
2 
1 



38 
98 

15 
2 



2k.Q 
64.1 
9.8 
1.3 



Total 



85 39 19 10 



153 



Percent 



55.6 25.5 12.4 6.5 



100.0 



Table 5.34 

Child's Arithmetic Expectation, Time 1 

1234 Total Percent 



Parent 
Arithmetic 
Expectation, 
Time 1 



1 

2 

3 

k 

Total 
Percent 



23 



12 



40 30 



8 12 



16 



38 



90 



23 



71 54 21 7 153 
46.4 35.3 13.7 4.6 



24.8 
58.8 
15.0 
1.3 



100.0 
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Table 3'33 

Child's Conduct Expectation, Time 1 

1 2 3 U Total Percent 



•Parent Conduct 
Expectation, 
Time 1 



2 

3 
k 



Ik 



k2 



10 



10 



26 



18 



1 

9 
2 



32 
95 
19 
2 



21.6 
6U.2 
12.8 
l.k 



Total 



67 40 29 



12 



1U8 



Percent 



1*5.3 27.0 19.6 8.1 



100.0 
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being 28.8* in reading, 37.9?^ in arithmetic, and 29.1^ in conduct (N « 153, 
153, and 148 pairs respectively) (Tables 5.33* 5.3^, 5.35). Purthennore 
there is little clustering around the main diagonals in the tables. 
Clustering would indicate closeness, if not exact, matching. Case by case * 
there is little correspondence between what parents expect and what 
children expect. 



Discrepances between Parents' Initial Expectations 
and the First Report Card 



There is little correspondence (no significant matching) between 
what children expect and the marks they receive on first report card in 
any of the three areas (see Tables 5.2, 5.3, 5.^). There is, however, 
a highly significant (z = 2.8o, p < .01) amount of matching between 
parents' e^ectations and children's marks in reading at the first report 
card. The accuracy in matching (55^) exceeds chance by a lOfji margin 

(Table 5.36). In other words, there is more matching than chance would 
predict. The same situation prevails in arithmetic (56^ matches vs. U?^ 
expected by chance; z = 3.00, p < ,01) and in conduct {3k% matches vs. U2^ 
expected by chance; z = 3.33, P < .01) for parents' expectations regarding 
first report card (ifebles 5.37, 5.38). 

As Just noted, parents' e^ectations show significant matching 
with children's marks at the middle of first grade (the first report 
card) in all three areas. What is happening in those cases where parents' 
expectations and childrens' marks differ? 

Most parents bet on a "B". Given the lack of variance in marks, 
it is hard for effects to display themselves (the power of tests is low) 
but nevertheless some interesting trends emerge. 

The marks in reading at the middle of grade 1 were worse than 
parents expected for 17% of the children and better for 285^. The 
tendency for non-matching expectations to underestimate marks reaches 
significance at the .05 level (Xi = U.19). Parents would probably prefer 
to err on the side of expecting too little than too much. At the end of 
the year the marks in reading showed approximately the same marginalfi and 
bore almost exactly the same relations to parents' expectations as marks 
obtained earlier. Tote that parents' expectations were only measured at 
about the middle of the year. Of those children who received A's in 
reading at the end of the year (N = 58), only 33^ of parents had looked 
for a better performanca. At this time, however, 33^ of children 
(compared to 28?& earlier) do better than their parents hoped and l8^ 
(compared to 17% earlier) do worse than their parents hoped. 

The marks in arithmetic at the middle of grade 1 were worse than 
parents ejcpected in 20% and better than expected in 23%, about an equal 
division. At the end of first grade, marks were worse them parenta 
expected in 12% and better in 3^% of the cases (Table 5.1+0). The degree 
of asymmetry f jr the "unmatched" cases at the end of grade 1 is highly 
significant (X^ = Ik, Oh, p < .01). There is some improvement during the 
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Table 3.36 



Parent Reading 
Expectation, 
Time 1 



2 
3 

Total 



Reading Mark, Time 1 
12 3 Total Percent 



19 
31 
2 



52 



17 
60 
6 
2 

87 



1 
8 
6 



15 



37 

99 
16 

2 

15i* 



2U.0 

6U.3 
10. U 

1.3 



Percent 33.8 56.5 9.7 



100.0 



Table 5.37 



Parent Arithmetic 
Expectation, 

Time 1 3 



ERIC 



Arithmetic Mark, Time 1 
12 3 Total Percent 



Ik 23 
17 67 
3 lU 



Total 3^ 10k 



2 
6 
6 
2 
16 



Percent 22.1 67.5 10. U 



5-^7 



39 
90 

23 
2 

15^ 



25.3 
58.U 
14.9 
1.3 



100.0 



Table 9. 38 



Parent Conduct 
Expectation, 
Time 1 



Total 
Percent 



Conduct Mark, Time 1 
12 3^ Total Percent 



17 



29 



13 



55 



13 



8 



U8 76 2h 1 
32.2 51.0 16.1 0.7 



32 
97 
18 
2 

IU9 



21.5 
65.1 
12.1 

1.3 



100.0 
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year in arithmetic marks, and this probably accounts for the smaller 
percentage of children who do worse than parents es^ected. Thus the 
picture at the end of the year duplicates almost exactly that seen for 
reading, differing to some small extent because of differing marking 
practices. 

In conduct at midyear 27^ of the children do better than their 
parents expect and 19^ do worse. At the end of the year 27% of the 
children again do better and 17% do worse (Table 5*^1)* In conduct 
children's performance is closer to what parents e3Q>ect than is true 
for either substantive area. Parents nay be cautious about pressuring 
children by esgpecting more than children can produce in reading or 
arithmetic. Iheir expectations may be more veridical in conduct where 
the child should have more direct control of his mark and where 
presumably direct parental pressure would be less damaging to performance. 



Discrepances between Parents' Expectations 
and Mark at End of First Grade 



Reading 

Marks at the end of first grade in reading, in contrast to midyear 
marks, do not agree significeuitly with parents' expectations (Table ^.39)* 
The percentage agreement is U9%, and chance expectancy is k3%» Significant 
agreement would require a percentage of 31% or more. Parents' expectations, 
unlike children's were sampled only once, sometime prior to the first 
report card. Parents' es^ectations for the end of the year, then, unlike 
children's, were not subject to modification by feedback during the year. 
(See the last portion of the preceeding section for further midyear- year 
end congparisons in all three areas.) 



Arithmetic 

Marks at the end of first grade in arithmetic (Table ^.Uo) agree very 
significantly with parents' initial expectations (z - 2.6?, p < .01), 
continuing the significant agreement noted at midyear. THie agreement, 
however, has dropped a few percentage points from to 33%» 



Conduct 

Marks at the end of first grade in conduct agree better with parents 
initial es^ectations than marks at midyear (:}6% agreement vs. 
agreement earlier). This year end agreement is highly significant 
(z s 3>^9> P < '01) as was the earlier agreement. 

In the three areas then, only for conduct do year end marks agree 
better with parents' midyear expectations than midyear marks. Both 
arithmetic and conduct murks rcmuin sif^lflcantly related to parents mid- 
year expectations while reading murks drop below significance. 

ERIC ^ ■ 



Table 5.39 

Reading Mark, Time 2 

12 3^ Total Percent 

1 19 15 3^ 23.0 

2 36 50 10 1 97 65.5 

Parent Reading 

Expectation. 3 3 9 4 16 10.8 

k 1 1 0.7 

Total 58 7** 15 1 ll*8 

Percent 39-2 50.0 10.1 O.7 100.0 



Parent 
Arithmetic 
Expectation, 
Time 1 



2 

3 
k 

Total 
Percent 



Table 5.^^) 

Arithmetic Mark, Time 2 
1 2 3 Total Percent 



21 
33 
3 



13 
52 
13 



57 78 



2 
2 
6 
2 
12 



38.5 52.7 8.1 0.7 



36 
87 
23 
2 

1U8 



2U.3 
58.8 

15.5 



100.0 
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Table 5M 



Parent Conduct 
Escpectatlon, 
Time 1 



2 

3 
k 



Conduct Hark, Time 2 
12 3^ 



19 
26 

3 



9 

55 
7 



Total k& 71 



1 

10 

6 
2 

19 



2 
2 



Total Percent 



29 
93 
18 
2 

lU2 



20.U 

65.5 
12.7 
l.U 



Percent 33*8 50.0 13. U 2.8 



100.0 



51 



How Parents' Esgpectations Compare in Different Areas 

There is a high degree of concordance between parents' es^ectations 
in reading and arithmetic— 63?t agreement (z = k.k3, P < .01). This is 
about Id percentage points above chance eigpectancy and so is significant 
in practical as well as statistical terms. (Children's concordance 
also exceeds chance significantly but by a smaller margin, 36^ vs. 26^.) 
Interestingly, while children's eaqpectations significantly favor 
reading over arithmetic, parents do not exhibit this bias. Of those 
parents whose guesses for arithmetic and reading do not agree, only 
about favor reading over arithmetic. 

The agreement between parents' guesses for conduct and arithmetic* 
like that for children's guesses, is not significant. 

The agreement between parents' guesses for conduct and reading is 
not significant either. This finding differs from that for children 
whose guesses for reading tend to match guesses for conduct. Also 
parents again do not favor one area over another when their guesses 
disagree. (Children looked for reading to be higher than conduct when 
their esqpectations were not matched.) 



How Dees Parents' Expectation-Mark Agreement Vary 
with the First Mark Received? 



Reading 

Using the same paradigm as that used earlier with children's 
expectations one can investigate how parents' forecasting ability 
varies across levels of marks received. Hkie marginal distribution for 
parents' expectations is A-2^, C-?0^6, and D-l^t. If no other 

force were at work, one would es^ect this set of guesses across each 
level of mark actually received. The distributions observed are given 
in Table ^.U2. 

Only in the case of B does the mode of any row fall in the "0" 
column. It looks as though all parents are playing safe-- guessing a 
"B" no matter what the ability level of their children is. All 
entries in the "0" column do indicate some over representation. The 
smallest over representation (69^ vs. 6^) occurs for the row based 
on the largest number of cases (8?). The parents whose children get 
C's appear more accurate but the number of cases (1^ in this row) is 
too small to warrant firm conclusions. 



Arithmetic 

A similar analysis can be applied to arithmetic. The marginal 
distribution for parents' expectations is A-25^, B-59^t, C-15^, D-1^. 
As Table 5.^3 shows, the mode for each row does not fall in the "0" 
column (as would happen if all parents accurately predicted marks), 

t - 
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Table 3*^2 

Parent Expectation-Mark Agreement by Level of 
Mark Received in Reading 



Parent Esgpectatlon Minus Mark; Reading, Time 1 



-3 


-2 


-1 


0 


+1 




•»-3 Total 


Percent 


1 














100^ 








19 


31 


2 


52 




2 












87 


100^6 


Reading 




17 


60 


8 


2 




Mark, 
















Time 1 ^ 


7* 




UO^ 












1 


8 


6 






15 




I* 
















Total 


1 


25 


85 


39 


I* 


151* 





Table 5M 

Parent Es^ectitlon-Mark Agreement by Level of 
Mark Received In Arithmetic 



Parent E^gpectatlon Minus Mark; Arithmetic* Time 1 



-3 


-2 


-1 


0 


+1 


+2 


+3 Total 


Percent 


1 






hli 


50^ 


9?t 


3U 


lOO^t 








Ih 


17 


3 








22^ 








lOU 


lOO^t 


Arithmetic ^ 




23 


67 


Ih 






Mark, 
















Time 1 ^ 


12^ 


3^ 


38^t 








LOO^t 




2 


6 


6 


2 




16 


















Total 


2 


29 


87 


33 


3 


19^ 
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Table ^.UU 

Parent Escpectatlon-Mark Agreement by Level of 
Mark Received in Conduct 



1 



Parent Eb^pectatlon Minus Mark; Conduct, Time 1 

-3 -2 -1 0 +1 +2 +3 Total Percent 

3^ 6oji % 100?t 

17 29 2 U8 



2 17^ 72^ 2^t lOQi 

Conduct 13 55 7 1 76 

Mark, 

Time 1 - df, 3^% 33% % 100^ 

^ 2 13 8 1 2lf 

u 100^6 100^ 

1 

« 

Total 2 27 80 37 3 1U9 
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but there is some movement in this direction for all ability levels. 
Again, while the pattern of over- representation in the "0" column is 
consistent, the smallest over- representation (6U^ vs. ^9^) appears in 
the row with the most cases (lOU). Hhe third row where the most 
interesting cases appear cannot shed much light because of the small 
number of cases (l6^. Parents seem a little less optimistic about 
arithmetic. 



Conduct 

For conduct marks the marginal distribution of parents' e^ectations 
is A-22^, C-12^, D-1^. The agreement between parents' e^gpectations 

and marks received is broken down by level of mark in Table ^.UU. 
Although for the reasons outlined earlier one might eicpect the closest 
agreement for conduct, as can be seen in Table there is only a 

slight departure from what would be expected under the assus^tion that 
the child's actual performance level has no effect. The magnitude of 
the over representation of the column headed "0" closely approximates 
the over representation that occurs for arithmetic (about 10^ on the 
average). The case bases, however, are more uniform than for arithmetic. 

To sum up: While parents do significantly better at forecasting 
children's marks than would be expected by chance the overall level of 
exact matching of forecasts and narks remains moderate. Exact matches 
occur in ^7^, and of the cases for reading, arithmetic, and 
conduct respectively. Is it parents of high, moderate or low ability 
students who are providing this extra accuracy in forecasting? In all 
three areas (reading, arithmetic, and conduct) parents exactly forecast 
children's marks more often than would be expected by chance, for children 
of all ability levels . In all three areas the smallest increment above 
chance levels occurs for the moderate ability students. Larger increments 
above chance occur for both the high and low ability students. The last 
two observations, while consistent across areas, may not attain 
statistical significance due to the generally smaller N's involved. For 
all levels of child ability the most frequently forecasted mark remains 
a "B" or "2". 



How Does Parents' Midyear Expectation-Mark Agreement 
Vary with the Mark Received at the End of First Grade? 



Tables relating the midyear discrepance between parents' expecta- 
tions and marks to year end marks are Tables 3.U3, 3.U6, and 5'h7. If a 
single causal relationship existed between parent midyear expectations 
and year end marks one would expect an over representation on the minor 
diagonal of these tables. While a slight over representation is present 
In the cell representing a year end mark of "2" and the "Some" midyear 
child performance as parents expected for all these tables, the most 
notable cells are those in the upper left corners of the tables. The 
size of these upper left cells Is anomalous. Why should children who 
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Tftble 3M 



Reading Mark, Time 2 



Child Old 
B«S.W Bian 
Parent Expected 
at Midyear 





1 


2 


3 




Better 


2k 


Ik 


3 


kl 


Same 


26 


1*5 


10 


81 


Worse 


8 


11* 


2 


2k 




58 


73 


15 


Ike 



Table 3M 



Arithmetic Mark, Time 2 
12 3 

Child Did 

B,S,W Than 
Parent Expected 
at Midyear 

57 77 12 1U6 



Better 17 12 U 33 

Same 29 50 5 84 

Worse 11 15 3 29 



Table 5M 



Child Did 

B,S.W Than 
Parent Expected 
at Midyear 





Conduct Mark, 


Time 2 






1 


2 


3 




Better 


20 


11 


k 


35 


Same 


22 


kk 


10 


76 


Worse 


6 


16 


5 


27 




kQ 


71 


19 


138 
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did better than their parents expected them to do at midyear get the 
highest mark possible at year end? Ihe lower parental esgpectations 
should produce some lower mark. A X test shows that only in reculing 
and conduct are children who received a "1" at year end significantly 
more likely to have had parents who e3Q>ected less — rather than more— 
in terms of their performance at midyear. (The respective Xi values 
are 7*03» P < .01; 0,33, N.S. ; and 6,03» P < .0^.) . 

The over representation can be understood in light of two other 
fcusts: First, if a child received a "1" at midyear he was very likely 
to also receive a "1" at year end. Diis occurs for 72%, ^3$ and 82^ 
of the cases for reading, arithmetic and conduct respectively. And» 
second, parents' midyear expectations matched the children's high 
performance only about one half of the time. At midyear, of the 
children who got I's the percentage of those whose parents e:Q>ected I's 
are ^1^, 36^ and ^3^ respectively for reading, arithmetic and conduct. 
Almost all the remainder of these parents had e:q)ected 2's. 

These two conditions are enough to insure cm over representation 
of the cells in question. This high stability of I's Insures that the 
roughly even split between parents e^^ecting the same or a better mark 
(with few expecting a worse mark) displays itself at year end as well 
as at midyear. Hie stability of I's itself might be taken as a sign of 
the causal ineffectiveness of parental expectations for those who 
received I's but such an argument would have to assume parents' 
expectations behaved quite differently than children's expectations in 
response to feedback. Remember, children's expectations rose sharply 
following positive feedback but did not drop nearly as much following 
negative feedback. If parents' expectations behaved in a similar 
manner, parents of children who received a "1" would come to expect a 
"1", removing any "downward force" of parental expectations on those who 
received I's at midyear. Unfortunately data to test the response of 
parental expectations to midyear mark feedback is not presently 
available. 



How Does Parents' Expectation-Mark Agreement 
Vary with Parent Expectation Level? 



As noted earlier there is a significant (above chance) tendency 
for parents' midyear expectations to exactly match their child's mid- 
year mark. Is this above chance matching evenly distributed among 
parents independently of the level of performance they e^ect from 
their child? 



Reading 

The pattern of cases we would have expected for reading, given 
chance matching between parents expectations and child's midyear marks 
is presented in the following table: 



Hypothetical Chance Distribution of Parent Expectation by 
Parent Expectation-Mark Agreement* Time 1 



Parent Expectation Minus Mark; Reading, Time 1 
-3 -2 -1 0 +1 +2 +3 



Parent E^^ecta- 2 
tion, Reading, . 
Time 1 ^ 



k 



1 



10 56 3^ 

10 56 3^ 

10 56 3h 

10 56 3h 



10CfK> 
100% 
100^ 
100?6 



This is arrieved at by placing the marginal distribution of children's 
marks (lO^t-3's, 56?^2's, and 3'+^-l's) in the appropriate section of each 
row. Given that parents show above chance levels of exact matching we 
would e^ect an over representation of the column headed "0" discrepance. 
The question here, however, is whether this over representation is stronger 
for those parents expecting any particular mark. Table 3,kB indicates 
the expected over representation of the column headed "0" occur fi at all 
expectation levels, but this over representation seems to be stronger 
for those e3a>ectlng either above or below average marks. Those parents 
expecting I's exceed chance expectation by 1? percentage points and 
those parents expecting 3's exceed chance by 28 percentage points while 
those expecting 2'8— in average mark—exceed chance by only 5 percentage 
points. (The row containing parents expecting l|'s is omitted due to 
insufficient cases.) 



Arithmetic 

In arithmetic the marginal mark distribution is 10^3 's, 66^2*8 and 
22^1 's. In coi^paring Table 5.^9 with a hypothetical table as constructed 
for reading— except using the arithmetic marginal percentages— we see 
that again the zerc discrepance column is over represented at all levels 
of parental expectation but that this over representation is largest for 
those expec^j^ both above and below average marks. Those parents 
expecting I's exceed chance levels by ik percentage points; those 
expecting 3's exceed chance by l6 percentage points; while those expecting 
2's exceed chance levels by only 6 percentage points. (l|'s &re omitted 
due to insufficient cases.) 



In conduct the mark distribution is lull's, 16^3 's, 51^2 's, and 
32^1 's. Table 5*50 indicates the hypothetical "0" discrepance column 
values are again exceeded at all levels of parents' expectations and that 
those parents holding above or below average expectations show the greatest 
excess. The excess for those expesting ]'s is 21 percentage points, 3's 
is 26 percentage points and 2's is only 6 percentage points. (4's are 
omitted due to insufficient cases.) 



Conduct 
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Table 

Parent Expectation-Mark Agreement by Level of 
Mark Esgpected by Parents in Reading 



Parent Expectation Minus Mark; Reading, Time 1 

-3 -2 -1 0 +1 +2 +3 Total Percent 

1 3% hei, 31% 100% 

1 17 19 37 

Parent 2 ^ 31^ 

t&renz d 8 6o U 99 

Expectation, o ou 

Reading, 
Time 1 3 



38% 50% 13% 100% 
6 8 2 16 



100% 100% 
2 2 



U 

Total 1 25 85 39 ^ 15^ 



Table 5*^9 

Parent Expectation-Mark Agreement by Level of 
Mark Expected by Parents in Arithmetic 



Parent Expectation Minus Mark; Arithmetic, Time 1 



-3 


-2 


-1 


0 


+1 


+2 


+3 Total 


1 


5% 
2 


59% 
P3 


36% 
lU 






39 


Parent 2 
Expectation, 
Arithmetic, 
Time 1 3 




?% 
6 


7H 
67 

26% 
6 


19% 
17 

6\% 


13% 
3 


90 
23 


1. 








100% 
2 




2 


Total 


2 


29 


87 


33 


3 


15U 



100% 
100% 
100% 
100% 
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Table 5.50 

Parent ExpectatlonoMark Agreement by Level of 
Mark Expected by Parents in Conduct 



Parent Expectation Minus Mark; Conduct, Time 1 



-3 -2 -1 0 +1 +2 +3 Total Percent 

1 ^ hvio 33% 100* 

2 13 17 32 

Parent 2 57* 30* 100* 

Expectation, ^3 55 29 97 

Conduct, , , . 

Time 1 3 6* UU* 39* 11* 100* 

1 8 7 2 18 

k 50* 50* 100* 

11 2 

Total 2 27 8o 37 3 11*9 



o 
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Summary 



In all three areas, reading, arithmetic and conduct, there is some 
above-chance matching between parents' expectations and their children's 
midyear mark, at all levels of parental expectation . Those parents , 
however, who hold either above or below average expectations for their 
children exceed chance to a greater degree than those parents who only 
expect an average performance ! This observation may mean either a) that 
the degree of causal efficacy of parental expectation differs by 
expectation level (which, for example, mi^t. be made plausible by 
arguments concerning "salience of parental expectations for the children") 
or b) that parents with either above or below average children are more 
likely to have come to expect this above or below average performance 
from their children than parents of average children are to ha/e come to 
expect an average performance from their children. 

While no significance tests for the differences in excesses of 
exact matching at the different parental expectation levels were done, 
the consistent pattern indicates these observations are worthy of 
attention. 



The Effects of Parents' Expectations on Changes in 
Children over the First Grade Year 



Effects on Marks 



Reading 

Changes in reading marks over the first-grade year as a function of 
parents' expectations vs. first marks are summarized in Table 3.31. As 
earlier noted, the sum of the elements on the minor diagonal gives a 
measure of the extent to which children's marks move in a direction 
consistent with parents' expectations. The movement is highly significant 
(z = 2.52, p < .01) . Looking at the four comers of the table one sees 
that the movement consistent with parents' expectations is ingpressive in 
both directions while that inconsistent is almost invisible (1 in each 
cell) . 

It is of some interest to compare Table 5.51 with the similar table 
(5.23) for children. The four corner cells there account for 50 cases, 
of which approximately two- thirds (36) move to increase agreement. In 
Table 5.51 the four comer cells account for 27 changes and of these 93^ 
move consistently. What Is perhaps most interesting is the size of the 
cell represent inK children whosp marks move down to increase agreement, 
that is, those who earlier did better than their parents expected. 

Arithmetic 

Changes in arithmetic marks over the first-grade year as a function 
of parents' expectations are summarized in Table 5.52. Here also the 



Table 3*31 



Change In Reading Mark, Tl to T2 







UP 


Same 


Down 






Better 


1 


28 


12 


1*1 


Child Previously 












Did B.S.W 


Same 


18 


50 


13 


81 


Than Parent 












E^^ected in 


Worse 


13 


11 


1 


25 


Reading 
















32 


89 


26 


lU? 



Child Previously 
Did B>S,W Than 
Parent Expected 
in Arithmetic 



Table 3*32 

Change in Arithmetic Mark, Tl to T2 





up 


Scune 


Down 




Better 


2 


26 


6 


3U 


Same ^ 


19 


63 


2 


8U 


Worse 


lU 


lU 


1 


29 




35 


103 


9 


lU? 



Table 5.53 
Change in Conduct Mark, Tl to T2 







Up 


Same 


Down 




Child Previously 


Better 


1 


2U 


11 


36 


DJd B.S.W Than 
Parent Expected 






58 






Same 


12 


7 


77 


in Conduct 














Worse 


15 


13 


1 


29 






28 


95 


19 


lU2 



movement toward agreement, tested by examining the sum of the elements 
on the minor diagonal, is highly significant {z « 3.21, p < .01). If 
the parent e3G>ected the child to perform more poorly than was the case 
at midyear, there is a trend for that expectation to be realized over 
the year— 18^ of those children have marks that move down. These 
cases account for two thirds of the instances where marks do move 
down over the year. 



Conduct 

Table 3-53 summarizes changes in conduct marks over the first-grade 
year in terms of whether first marks received were consistent with 
parents' expectations. Again the movement toward consistency is hi^ly 
significant (z = U.50, p < .01) and a large proportion of those whose 
marks move down (58?t)are recruited from the group who performed better 
than their parents earlier expected. 

A comparison with the similar table based on children's expectations 
for themselves (Table 5.29) again suggests that the press to make 
performance coincide with expectations seems slightly stronger for 
parents' expectations than for children's espectations. 

In sum , children's year end marks in all three areas sliow 
significant trends toward agreeing with parents' midyear expectations. 
The over representation on the minor diagonal is consistent with a 
causal hypothesis viewing parents' expectations as the independent 
variable. 



Effects on Expectations 



Reading, Arithmetic, Conduct 

There Is some tendency over the first grade year for children's 
expectations to rise a little In all three performance areas. This 
change In children's expectations does not appear to be related to the 
child's midyear performance level evaluated against parents' expectations. 
If a parent expected less than his child delivered. In all three areas 
(see Tables 5.^) 5.55) and 5.56) there may be a slight tendency for the 
child's expectations to Improve . The top left comer cell In all three 
tables Is larger than the other three corner cells, but In no case Is 
this association strong enough to attain significance. This seemingly 
counterintuitive observation might have arisen as follows: For the 
children who did better than their parents expected It Is quite reasonable 
to expect a large proportion of these children to have done better than 
they themselves expected. That Is, a large majority of parents expect 
a B (or "2"), therefore If the child's performance exceeded this the 
child probably received an A (or "1") at midyear, which In turn would 
suggest a fairly high likelihood tnat the child's performance would 
exceed his own expectations. If the child expected anything other than 
the top mark possible this would besthe.case. As noted in an earlier 
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Child Previously 
Did B,S,W Than 
Parent E:a>ected 
in Reading (Tl) 



Tftble 3*3^ 

Child's Reading E^gpectation Change, Tl to T2 





Up 


Same 


Down 




Better 


11 


2k 


6 


kl 


Same 


2k 


ko 


16 


80 


Worse 


7 


10 


8 


25 




U2 


7k 


30 


1U6 



Child Previously 
Did B.S,W Than 
Parent Expected 
in Arithmetic (Tl) 



Table 5.55 

Child's Arithmetic Escpectation Change, Tl to T2 





up 


Same 


Dovm 




Better 


10 


17 


6 


33 


Seune 


25 


38 


21 


8h 


Worse 


k 


17 


8 


29 




39 


72 


35 


1U6 
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Child Previously 
Did B,S,W Than 
Parent Expected 
in Conduct (Tl) 



Table 5.56 

Child's Conduct Expectation Change, Tl to T2 





up 


Same 


Dovm 




Better 


Ik 


17 


7 


38 


Same 


31 


28 


16 


75 


Worse 


11 


10 


7 


28 




56 


55 


30 


Ikl 
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section on feedback, If a child performed better than he expected, there 
was a strong tendency for his expectations to rise— which is what 
probably happens to the children being discussed here. 

The "counterintuitive" observation has now been e^lained as follows: 
If a child did better than his parent escpected his expectations may tend 
to rise because the child is likely to also have performed better than 
he himself escpected— a condition previously shown to produce a rise in 
children's expectations. The cause of the expectation rise is then seen 
as the consequence of positive mark feedback for the child, and not 
something connected with the "relatively lower" parental expectation* 



Sex Differences 



Sex and Marks 

The sex differences seen for individual cohorts are generally borne 
out more strongly in the combined data. More boys get the poorest grade 
(C) in reading in the middle of first grade than girls. The same is 
true for conduct. The odds are roughly 2:1 in both areas that a boy 
will get the lowest mark. The number of low marks actually issued in 
both areas is small, however, amounting to '11^ of reading marks and 
17% of conduct marks. In arithmetic low marks are about equally 
distributed between the two sexes at midyear. 

By the end of first grade there is an even more marked tendency for 
boys to account for the poor marks in reading and conduct, while poor 
marks in arithmetic continue to be assigned equally to both sexes. The 
changes over the year by sex can be seen in Table 

At midyear the tendency for boys to receive the poorest marks in 
reading is not strong enough to be significant. By the end of the year, 
however, Ql% of the poorest marks awarded in reading go to boys and this 
is a highly significant sex-related discrepance (z = 2.62, p < .01). The 
same pattern exists for conduct. By the end of the year, 8% of the 
poorest marks awarded go to boys, again a highly significant disparity by 
sex (z = 3.U6, p < .01). 

Turning now to the other extreme of the distributions of marks for 
reading, one can examine the awarding of A's: At midyear 32^ of the 
boys get A's and 37% of the girls get A's, not significantly different 
(N = 96 for boys, N = 82 for girls). But over the year the picture 
changes. Ten more A's are awarded at the end of first grade than at 
midyear, and 9 of the 10 go to girls. Even with such a small number, 
there is a significant sex related difference (z = 2.21, p < .0^). 
About the same number of C's are awarded on both occasions but, as 
already shown, boys receive a significantly larger number of C's them 
girls at the end of the year. 
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Table 5.57 
Students Getting C or Less by Sex 



Reading 
Midyear Year End 



Boys 
Girls 

Total 



Ik 
6 

20 



17 
k 

21 



Arithmetic 



Midyear Year End 



11 
11 

22 



8 
7 

15 



Conduct 



Midyear Year End 



20 
11 

31 



23 

k 

27 
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The conclusion is apparent-- there is very little difference in 
marginal distribution of marks in reading at the end of the year coQpared 
to midyear and almost all of it is accounted for by the awarding of a 
few more A's. In all cases but one these A's are awarded to girls. 

For conduct 7 more A's are awarded at the end of the year than 
were awarded in the middle. Here^ however , boys are the recipients ^ 
with 6 of the 7 "new" A's going to boys. So in spite of the increased 
propensity for the poorest marks in conduct to go to boys, there is 
also a push for boys' marks in conduct to move up. 

In arithmetic despite stability at the low end of the distribution 
of marks ) there is relatively leurge linprovement in the average mark 
given- -32 more A's are awarded at the end of the year than at midyear. 
Nineteen of these "new" A's go to boys, and 13 go to girls, an association 
with sex that is not significant. There is a ^3:^7 split of A's in favor 
of boys at midyear and this split widens to 36:kk at the end of the year, 
but the division is not significantly different from a ^0:^ split at 
either time. The trend, however, can be contrasted with that in reading 
where most of the Improvement in marks was noted for girls. 



Sex and Children's Expectations 

What of the child himself in terms of his sex vs. his expectations 
over the first-grade year? At midyear, when expectations of children are 
sampled for the first time, females may be a little more cautious than 
males in both reading and arithmetic. Eight per cent of girls vs. ^ of 
boys expect the lowest mark (D) In reading (which none actuedly receives). 
Two per cent of boys and 8^ of girls e;a>ect the lowest mark in arithmetic. 
By the end of the first year only 3% expect C's or O's in reading, divided 
about equally between boys and girls (compared to 19^ and 17% respectively 
at midyear). More expect low marks In arithmetic— lU^ expect C's or D's 
and again the sex division is about equal. 

But the decreases in expectations for A's over the year are sex- 
related. One more girl expects an A in reading at the end of the year 
and 8 fewer boys do. There is thus a very clear parallel between 
children's expectations and the teachers' marking behavior for, as was 
seen earlier, the increased number of A's awarded in reading go almost 
entirely to girls. 

Just the opposite tendency may be seen In arithmetic expectations 
for A's by sex. Eight more boys expect an A at the end of the year than 
expected an A at midyear, and 3 fewer girls look for an A at the end. 
Again children's expectations are veridical with teachers' marking 
behavior for, as seen earlier, "new" A's in arithmetic are somewhat 
likelier to go to boys. 

For conduct more boys— 33^ vs. 23^ (N = 100, and N •- 86 respectively) -- 
than girls expect the two lowest grades at midyear. Also more girls than 
boys expect an A {^2% vs. hl%) , At the end of the year a large majority 
of girls (6%, N = 8i*) expect an A, whereas h3% of the boys (N « ICQ) 



ERIC 



expect an A. This difference is significant (Xi « 9.26, p < .01). Two 
more boys and 10 more girls look for an A at year end compared to mid- 
year. However, as seen earlier, of 7 ''new" A's in conduct, 6 go to boys. 
It appears that girls' expectations are not veridical in the case of 
conduct. 



Sex and Parent Expectations 

Of those parents (19) who expect their children to do poorly in 
reading (C or D), 6&f, expect it of a son. The sex difference is not 
significant with a sajnple this size. ■ Of those parents (2?) who expect 
their children to do poorly in arithmetic, a majority (56^) expect it 
of girls. In conduct 21 parents expect their children to do poorly and 
71^ of them expect it of a son. This last trend is significant beyond 
the 10^ level (z = 1.7^). To sum up, although parents' expectations 
are in the culturally predicted direction for reading and arithmetic, 
trends here are not strong enough to be statistically significant. 

The marginals for parents' expectations and the marginals for 
marks obtained in reading are quite similar in data for combined cohorts 
or in each cohort separately. Coiqpared with the 12^ of parents who look 
for poor marks in reading, 10^ of all children actually get poor marks. 
But of the 12 boys whom parents expect to get low reading marks, only k 
actually do. Two boys whose parents expect them to get "k" get "2", and 
six boys whose parents expect them to get "3" get "I's" or "2's". On the 
other hand, 7 boys do get "3's" whose parents look for higher marks. 
There is about the same variance in prediction for girls, but only k 
girls get a mark of "3" (none gets "U"). 

In arithmetic 10 (12^) parents expect their sons to do poorly, and 
only 2 boys confirm these expectations. In arithmetic 13 (21^) parents 
expect their daughters to do poorly and 6 daughters confirm that. The 
numbers are small, but three tendencies exist: parents are more likely 
to expect girls than boys to do poorly in arithmetic; of those children 
whose parents look for poor performance, the large majority do well, but, 
if there is a tendency to conform with parents' expectations, it may be 
for girls. In conduct parents' expectations are more nearly fulfilled, 
for 6 of Ik boys whose parents expect them to get low marks actually do 
so. Of the boys who expect low marks in reading (20^, N = 83), about 
one-fourth have expectations consistent with their parents. A convenient 
summary of the picture for boys whose parents have low expectations is in 
the upper half of Table ^.^8. What is noticeable are (l) the variability 
of children's expectations in the face of low parental hopes, and (2) the 
high percentage (67^) who get A's and B's for actual marks when parents 
expect low marks. Of boys whom parents expect to do poorly in reading, 
one-third {k out of 12) get a C and 5 expect to do poorly. 

The picture can also be viewed from another perspective. For 
children with low expectations, what are parents' expectations and actual 
marksv Of boys who themselves expect to do poorly in reading, relatively 
few (3 out of 18) actually do poorly but 5 nave parents who expect them 
to do poorly. The data are summarized in the lower part of Table 5.58. 
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Table 3*38 
Boys' Reading 



Parents Escpect Child Expecta* Child Gets* 

A - 7 A - 2 

C-10 C-2 B-U 

D - 1 C - U 

D-2 C-1 B-2 

D - 1 



12 12 12 

* The letter grades and frequencies reported for child's expectation 
and marks are provided within the specified categories of parent 
expectations . 



Child Expects Parent Expects* Child Gets* 

A - 3 A - 2 

B - 6 B - 9 

C-2 C-2 
D - 1 



C - 13 



D - 5 



A - 


1 


A - 2 


B - 


2 


B-2 


C - 


1 


C-1 


D - 


1 





18 17 18 

* The letter grades and frequencies reported for the parent's expecta- 
tion and child's marks are provided within the specified categories 
of child expectations . 
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Table 5.59 
Girls' Arithmetic 



Parents Expect Child Expects* Child Gets* 

A - 5 A - 1 

C-lU B-7 B-8 

C - 1 C - 5 

D-1 D-1 C-1 



15 Ik 15 

* The letter grades and frequencies reported for child's expectation 
and marks are provided within the specified categories of parent 
expectations . 



Child Expects Parents Expect* Child Gets* 

A - 1 A - 2 

C - 16 B - 10 B - 11 

C-1 C - 3 

A - 1 

D-6 B-3 B-4 

D-1 C - 2 



22 17 22 

* The letter grades and frequencies reported for the parent's expecta- 
tion and child's marks are provided within the specified categories 
of child expectations . 
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Again the numbers at this time are too small to Justify conclusions but 
If a parent expects a child to do poorly the child's expectations agree 
in k2ff> of the cases, while if a child expects to do poorly the parent is 
less likely to agree (29^). In both cases a large majority of children 
actually do well"67^ of children whose parents have low hopes get A's 
or B's, while 83^ of children who themselves have low hopes get A's or 
B's. 

Girls whose parents have low expectations for arithmetic do not 
themselves hold low expectations but Uo^ do get a low grade (almost the 
same kind of relation as for boys' reading). However, 6o^ get a good 
grade. If a girl expects to do poorly, her parents are not likely to 
agree, and only 23% actually get low grades. Again if a parent has 
low hopes it is more predictive than a child's having low hopes, but 
both are more often wrong than right. 

This general pattern persists throughout the entire set of tables 
of this type for reading, arithmetic and conduct, as well as for both 
sexes. 
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CHAPTER 6 

RESULTS FOR ONE FIRST-GRADE COHORT (1972-73) 
LOfER-CLASS INTEGRATED SCHOOL {kOfjl, WHITE) 



Tabular Summary 



Means and standard deviations for variables used In subsequent 
analyses and cross- tabulations are given In Table 6.1. The mean IP 
(Primary Mental Abilities) for 130 children for whom IQ data were 
avallGible from school records (88^) Is lOk.63 with a standard 
deviation of 22.6. The data are given for the entire cohort and also 
separately for white students and black students. Hhe overview will be 
given first for the entire cohort and then separately for its white and 
black portions. 



Entire Cohort 

(1) In the lower-class integrated school parents' expectations in 
all three areas are much lower than children's expectations (significant 
beyond the .01 level for reading, arithmetic, and conduct). The mean 
expectations of parents and children, respectively are 2.33 vs. 1.28 in 
reading, 2.23 vs. 1.7*^ in arithmetic, emd I.89 vs. 1.07 in conduct. 

Esgpectatlons were obtained from only jk'fi of the pcurents, even though 
Interviewers went repeatedly to the homes of lower-class children to 
contact the parents. In advance of the interviewer's call a note was 
sent home from school with children esgplaining that an Interviewer would 
come to the home a day or two hence and also what the purpose of the 
interview was. As many as three call- backs were used to try to reach 
lower-claf^x parents and care was taken to send a black interviewer to see 
black parents and a white Interviewer to see white parents. 

(2) Before their first report card children are exceedingly optimistic 
in reading and conduct. Their expectations in reading and conduct are 
significantly higher than their e3Q>ectations in arithmetic (p < .01 for 
both paired t- tests). 

(3) Children's average expectations in all three areas remain 
remarkably constant over the first-grade year. The largest difference, 
that for conduct, is only 0.I6 of a grade point. 

(U) Children's marks on the first report card are low. (The reading 
marks are particularly hard to evaluate at this time because teachers use 
different marking schemes). The arithmetic marks average 3.03 and conduct 
marks average 1.93> 
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(^) Children's marks Inprove significantly in all three areas over 
the first-grade year (p < .01 for all three paired t-tests). Wrovement 
is largest in reading (0.^6)) next largest in arithmetic (0.26)^ and 
least in conduct (0.21). 



White (N = 57) and Black (N = 86) Children in Cohort 

(1) White parents and black parents have comparable expectations 
for their first-grade children except in conduct where white parents look 
for a significantly better mark (p < .01). Expectations were obtained 
from 77^ of white children's parents and from 12% of black children's 
parents. (The race of 5 children could not be ascertained.) Parents' 
expectations are uniformly lower than children's within each racial 
group. 

(2) Before their first report card children of both races are hi^ly 
optimistic about their forthcoming marks in reading and in conduct. Both 
races are noticeably less optimistic about marks to come in arithmetic^ 
with whites on the average estimating an arithmetic mark 0.18 units lower 
than blacks (difference not significant). 

(3) Children's average expectations over the year are remarkably 
constant no matter what the race of the child. 

(U) Average reading marks are very close in the two races (B-l'^ vs. 
3.16). Small differences (not significant) favor whites in arithmetic 
(2.90 vs. 3.09) and in conduct (1.77 vs. 2.03) (first report cardK 

(5) Children's marks over the first grade year improve more for 
whites theui for blacks in both reading and arithmetic (for whites^ 
0.71 and 0.31 respectively, and for blacks 0.U7 and 0.20 respectively). 
In conduct the changes are small (0<l8 and 0.22) and close to one another. 



Discrepances Between Initial E^gpectations 
and the First Report Card 



First, a word is needed about how marks are assigned in general, and 
in particular for reading. The report card which children take home 
contains the explanation below: '*Your child's progress is being measured 
in terms of his progress in reaching standards or levels that are 
considered appropriate for his age or years in school." Marking is thus 
done with respect to grade level performance, comparing the child's 
progress with average progress of children his age. It is Important to 
note that the child's ability is not weighed in« so a child with low 
ability is bejnfr, moaaurcd against the same performance scale us the child 
with high ability, (in the suburban school children were graded in terms 
of how their perform/mce compared bo their expected performance adjusted 
tor ability level. ) 



Three different schemes were used for reporting marks in reading for 
this cohort: the number of Distar units coi^pleted, or the primer level 
(two series) of the book the child has completed. In the case of primer 
level, two different reading series were used. Since these series use 
the same letters, but the letters designate different levels of primers 
in the two series, the system is confusing. "P" in one system, for 
example, means a higher mark than "P" in the other. 

To keep initial marks consistent with later marks, they were changed 
to a scale like that used for other subjects. 



Reading 

Close to half of the marks given ik3%) in reading appear equivalent 
to a U. Most of the children (73^), however, e3g>ect the highest mark in 
reading. Only 3% expect a C, and none expects lower. There is a marked 
contrast between the marginal distribution of children's expectations 
(strongly skewed toward the high end) and the marginal distribution of 
marks given' by the teachers (strongly skewed toward the low end). This 
difference leads to a highly significant asymmetry for children's 
expectations to exceed the marks received (X^ = 5U.0, p < .01). 

Since no marks equivalent to A's are given it may appear as if our 
recoding of letter marks to numerical marks is faulty and the whole 
numerical scale should have been shifted one position lower. This is 
not the case however. Both the teacher's comments on the students' 
report cards and the levels of student cong)etence normally associated with 
the letter grades involved, suggest the teachers did indeed view the 
students as displaying a range of competence markedly skewed toward the 
low end. Such a rescaling would also have placed this set of marks 
coinpletely out of line with those given at year end when the usual 
marking system was used. As it stands, not a single student has his 
performance exceed hib expectation in reading and only 10^ of the 
students were able to meet their expectations. (7^ would have been 
expected to do so by chance eJlone.) A large fraction of those who 
receive the lowest mark (21 out of 26 or Ql%) expect the highest. 



Arithmetic 

Most children expect a B (60^) in arithmetic. Ejcpectations in 
general are high— over 9% expect the top two grades. Fewer students 
have hopes and marks that match than one would expect by chance alone 
(16.7^ would be expected to match by chance but only 12i> of the cases 
actually do match). The child's mark is apt to be less than he expects. 
In fact 29^ of the children receive the lowest possible mark in arithmetic 
and 75i received the lower two^ marks. As with reading, in arithmetic 
Ihore is a marked asymmetry (Xj - 91.0'», p < .01), the children expecting 
better marks than they receive, of thooo whose mark falls to match, 99^ 
of the mlsnuiLchen fire of the type where the mark is lower than the child 
hoped. Again one note:; the large number of children (29^) who receive 
the lowest possible mark, compared to the small number expecting either 
of the two lowest marks 
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Table 6.2 

Reading Mark, Time 1 
1 2 3 U Total Percent 





1 


10 


13 


21 


kk 


73.3 


Reading 


2 


5 


5 


3 


13 


21.7 


Expectation , 














Time 1 


3 




1 


2 


3 


5.0 




U 














Total 


15 


19 


26 


60* 






Percent 


25.0 


31.7 


U3.3 




100.0 



* One class Is missing reading marks. 



Table 6.3 
Arithmetic Mark, Time 1 







1 


2 


3 


k 


Total 


Percei 




1 


1 


16 


12 


9 


38 


35.2 


Arithmetic 


2 




9 


37 


19 


65 


60.2 


Expectation 


» 














Time 1 


3 




1 


1 


1 


3 


2.8 




k 
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Table 6.U 

Conduct Mark, Tine 1 
1 2 3 If Total Percent 

1 22 67 11 100 92.6 

Conduct 2 6 2 8 7.4 

Esgpectation, 

Time 1 3 

k 

Total 22 73 13 108 
Percent 20. U 67.6 12,0 100.0 
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Conduct 



In conduct we find the only area in which teachers give out substan- 
tial numbers of the highest possible marks* Twenty per cent of the 
students are awarded I's in conduct while 68^ receive 2's and 22% receive 
3's. Since 93^ of the students expect I's. however, the significant 
asymmetry between es^ectations and marks (X = 80.00, p < .01) remains. 
The matching between marks and expectations (26^) is about what one would 
expect by chance {2k%), given the distributions involved. Hhere remains 
a strong tendency for children to receive a mark lower theui the one 
they expect. 

In summary the relationship between children's escpectations and 
marks at the middle of grade one strongly reflects the pronounced 
positive skew in the expectation distribution. Ohis produces an 
asymmetry in the relationship between marks and expectations in all 
three areas. The children expect a higher mark than they receive (or 
conversely receive a lower mark than they expect). This asymmetry is 
enhanced in the cases of reading and eu^ithmetic by the fact that the 
mark distributions in these eureas eu:e negatively skewed. 

In general students' expectations match the mark they receive with 
about the same frequency one would expect on the basis of chance. The 
matches slightly exceed those e^qpected by chance in reading and conduct 
but the reverse is true for arithmetic. 



Discrepances Between Esqpectations and Report Cards 
at the End of First Grade 



Readi ng 

The agreement between children's expectations for reading and the 
mark they receive at the end of first grade is slightly (but not 
significantly) below what would be expected by chance (11^ vs. lU^). A 
relatively small proportion of the cases is involved since the children's 
reading es^pectations are highly skewed toward high expectations (8l^ 
expect a 1; mean » 1.32) while the mark distribution is more nearly 
symmetrical (mean » 2.39). Of the 89^ of the cases where the student's 
expectation does not match his nark, a significant majority, 92^ 
(Xj = 71.5, P < .001), expect a mark higher than the one they receive. 

Arithmetic 

Agreement between marks in arithmetic at the end of grade one and 
expectations at that time is almost exactly at the chance level {22.3% 
expected by chance, 22.6^ observed). As with reading, for EU'ithmetic 
generally higher expectations prevail (mean = 1.73) than marks attain 
(mean = 2.77) though the ejipectation distribution is not as markedly 
skewed as in the case of reading. Forty-three per cent of the children 
still expect a 1 in arithmetic. As In reading the preponderance of high 



Reading Mark, 
Time 2 
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Table 6.5 

Reading Expectation, Time 2 
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Arithmetic Expectation, Time 2 
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Table 6.7 

Conduct Expectation, Time 2 
^ 2 3 k Total Percent 
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expectations in arithmetic results in a significeuit majority of mis- 
matched cases (83^) expecting a higher mark than they receive (X^ - 39*11* 
p < .001). * 



Conduct 



In conduct both the mark and expectation marginals are skewed toward 
the high end, with the e:q)ectation marginal being slightly more skewed 
than the mark marginal. As a result of this similarity in marginals a 
greater proportion of matches are ejected {kk%) than were escpected for 
reading {lk%) and arithmetic (22^). The observed matches (U^) do not 
significantly exceed this chance level of matching. Again asymmetry is 
present. Of those whose conduct marks and expectations differ at year 
end, a significant majority (8U^) have higher expectations than marks 
(Xj = 29. 3U, p < .01). 

In sugary, at year end most of the observed relations between 
marks and escpectations result from the skew towards high e3Q>ei7t,ations 
on the expectation marginals. This results in significant asymmetry 
between marks and e3q)ectations (children generally holding higher 
esQJectations than the marks they receive) in all three areas. It also 
results in a relatively small proportion of matches being expected by 
chance alone (except for conduct where a skewed mark marginal increases 
the proportion expected). The chance levels of matching are not exceeded 
in any of the three areas . The general year end pattern of: asyimnetry 
due to skewed expectation marginals, small proportions of expected 
matches, and lack of significant deviation from expected matches, is the 
same as the general pattern observed between marks and expectations at 
midyear . 



. How Do Children's Esgpectatlons Change 
Over the Flrst-Grade Year 



Reading 

The marginals for reading esipectations sampled in the middle 
of the year and those sampled at the end are almost identical, with 
close to Q0% of children e:Q)ecting the highest mark possible on both 
occasions. There is no shift downward, for the same number move up as 
move down (I5). Since 79^ expect a "1" on both occasions, a high 
percentage of matches {62%) is expected by chance. The number of 
matches observed, 6%, is larger but not significantly larger. The lack 
of variance in both expectation distributions makes the question some- 
what trivial. 

Since about the same number of children get more optimistic as get 
more pessimistic, it appears that the marked discrepance between the 
child's initial expectation and the first mark he received has had little 
impact. The expectations of children remain very high—close to 8o^ expect 
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Table 6.8 

Reading Expectation, Time 2 
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Arithmetic 
E:Q>ectation, 
Time 1 
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Table 6.9 

Arithmetic Expectation, Time 2 
12 3k Total 
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Table 6.10 

Conduct Es^ectatlon, Time 2 
1234 Total Percent 

1 73 15 1 1 90 93.7 

Conduct 2 If 1 1 6 6.2 

Expectation, 

Time 1 3 

k 

Total 77 16 2 1 96 
Percent 80.2 I6.7 2.1 1.0 100.0 
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the highest grade.' The effect of feedback will be discussed In detail 
in subsequent sections. 



Arithmetic 

There Is a modest increase In the variance of arithmetic expectations 
over the year. On first inquiry over 95^6 of the children expect a "1" or 
a "2"; on second inquiry about expect those grades. On the other 
hand, 7 more children es^ect a "1" on second inquiry than did on first 
inquiry. The total number of persons shifting up exactly equals the 
total number shifting dovm (28 move in each direction) but 9 of those 
moving down register a decrease of 2 units. 

The number of matches, U2^, is almost exactly what one would 
predict by chance. The most noteworthy fact is that a very small number 
of children look for a mark below a B on first inquiry, whereas l6^ have 
lowered their hopes on second inquiry. 

Conduct 

There Is some mild movement toward lower e:q>ec tat ions for conduct 
grades, expectations moving from 1 to 2 over the year, but about 77^ have 
exactly the same es^pectatlon later in the year as they had earlier. 
Again, as earlier, the picture in conduct is largely uninteresting 
because such a large percentage of the children expect A's. 

To sum up: There is very little difference in the espectatlons at 
midyear and year end In either reading or arithmetic. A mild movement 
down in conduct is not interesting because overall expectations are 
still so high. 



How Do Expectations Change over the Ytar in Relation 
to Initial Expectation Level? 



Reading 

About 79^ had high expectations at midyear (I's) and almost the 
entire remainder expected 2's (17^). About equal numbers of children 
move down (Table 6.11) but of the 16 children with expectation 2 at 
midyear 7% (11) move up whereas of the 75 children with expectation 1 
at midyear only 1S% move down. The reader should note that movement 
from an expectation of a 1 can only be in a downward direction (i.e., to 
a 2, 3 or U) while movement from a 2 can be either upward (to a 1) or 
downward (to a 3 or U). 

The preponderant trend then is that a very high percentage have 
extraordinarily high expectations to start with and tend to keep them, 
while those with not- so-high initial expectations are very likely to 
Increase them ( p < .01). 
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Table 6.11 



Reading Expectations at Midyear 
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Table 6.12 
Arithmetic E^Qpectatlons at Midyear 
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58 
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Arithmetic 



Fewer children have very high e3qpectatlons in arithmetic at both 
times. Of those whose expectations change about as meuiy move up as move 
down over the year. Again note that children with an initial expectation 
of a "2" can move up or down. As in reading, a greater proportion of 
children with 2's initially change their expectations (mostly upward) 
compared to those who initially had I's but this difference is not 
significant (Table 6.12). 



Conduct 

In conduct expectations are so likely to be "1" to start with that 
it is not sensible to see how many stayed the same or changed as a function 
of initial expectations. Again, for those who had I's the only possible 
movement is downward movement, which in this Instance happened to be 
23% of the cases. 



How Marks Compare in Different Areas 



First Report Card 

Initial marks in reading and arithmetic show more consistency than 
would be e3Q)ected by chance. Although 31^ agreement would be predicted 
by chance, 33lo is observed (z - h,Ok, p < .01). Teachers then are rather 
consistent in the marks they assign in the two subjects. The average 
mark in arithmetic is 3*03 vs. 3*1^ in reading at midyear, slightly but 
not significantly higher. 

For reading and conduct the e^tected agreement of the two marks is 
2hi, while 31^ agreement is observed. Though not significant this difference 
is in the direction suggesting a consistent assignment of marks between 
the areas. 

The observed agreement between arithmetic marks axkd ^^onduct at mid- 
year is 22^, which is equivalent to that expected by chance (22%), There 
is no more consistency, then, between marks in arithmetic and conduct 
than would be expected by chance, and probably the same holds true for 
reading and conduct. It appears that teachers assess the behavior area 
separately from the substantive areas, yet assess the substantive areas 
similarly. 



Marks at End of First Grade 

At the end of the year the agreement between marks in arithmetic and 
reading has increased to ^3^, and this exceeds chance (29^) by an even 
great(T margin than at midyear (z - 7.6o, p < .001). As noted elsewhere, 
teachers at the end of the year give both more high and low marks in 
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Table 6.13 
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One class is missing Time 1 I?eading Marks. 



Table 6,lh 
Arithmetic Mark, Time 2 
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Table 6.13 

Conduct Mark, Time 2 
123 Total Percent 
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arithmetic compared to midyear (the standard deviation Increases from ,7k9 
to .969) and there Is a rise In average mark from 3. 03 to 2.77. In 
reading the Increase in marks is dramatic— from 3.15 to 2.59, with no 
change In variability. Ttie agreement between marks in reading and 
conduct Is what would be expected by chance alone, both the chance and 
observed matches being 2% of the cases (N » 127). 

The agreement between arithmetic and conduct at the end of the year, 
29^, exceeds expectation, 22^, and is significant at the .05 level 
(z = 2.21). 

By the end of the year, then, there is a noticeable increase in 
homogeneity in marks for reading and arithmetic, and for the first time 
a significantly greater than chance matching between arithmetic and 
conduct. Reading marks remain related to conduct marks only at a level 
expected by chance. 



How Do Marks Change Over the Flrst-Grade Year? 



Reading 

The second reading mark shows k3% of the children falling in the 
lower two categories, getting 3's and U's. About the same number (U2^) 
were in the lowest category (U) for the first reading mark. The 
remainder at the second time point split about 1:3 between I's and 
2's respectively (Table 6,l6). About 30% had received 2's earlier, none 
received I's. The marking scale has thus been altered at both ends 
with fewer children at the end of the year getting the lowest possible 
grade and a modest number (15^) receiving the highest possible grade. 

By the end of the year the majority of children {57%) receive the 
two highest marks because so many 2's are given (^2^). There is a 
significantly greater than chance degree of consistency in marks over 
the year (39^; z = 2.38, p < .05). Despite this greater- than- chance 
matching of midyear and year end reading marks the majority of marks (61^^ 
do not match at both times. All these mismatches have the year end mark 
exceeding the midyear mark which is not too surprising considering the 
pr3nounced skew toward lower marks that was present at midyear. (The 
asymmetry of matches is significant beyond the p < .001 level with 
x| ^ U2.0.) 

Arithmetic 

Teachers give higher marks in arithmetic as the year progresses. 
Whereas 26f^ of the children received I's and 2's on first marking, about 
kO% received I's and 2's later in the year (Table 6.17). About the same 
fraction of children (26^, vs. 27^^) get the lowest grade both times, and 
the bulk of these cases are provided by children who received the same 
mark on both occasions (19?^). The overall improvement in arithmetic 
marks is significant- -I6 move down while hO move up (X^ = 10.29, p < .01). 

1 
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Table 6.16 
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Table 6.17 
Arithmetic Mark, Time 2 
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The strongest tendency, nevertheless, is for marks to remain the same. 
For of children marks remain the same. The degree of consistency is 
significantly greater than that expected by chance (30^6) (z = 7.22, p < .01). 



Conduct 

Here as well there is considerable consistency in conduct marks 
over the year. About 30^ of the children receive the same grade on both 
occasions with 37^ expected to do so by chance (Table 6.l8). There is, 
then, a significant consistency in conduct marks (z - 3.10, p < .01) 
from the middle to the end of the year. Of those children whose marks 
do shift, the shift is significantly in a positive direction (U3 ii^prove, 
18 decline, X* = 10.24, p < .01) resulting in a general increase in 
marks over th4 year. Given that the marks were reasonably high at 
midyear, this results In the marks being skewed toward the high end at 
the end of the year. 

In summary , then, higher marks are given at year end than at midyear 
In all areas. This results in a significant asymmetry in the tables 
presented. Or, equlvalently, for the majority of the cases in which marks 
do not match at the two times, the mark received at year end is higher 
than that received at midyear. Ttie general increase in marks reduces 
the skewness towards low marks in reading and arithmetic that was present 
at midyear and produces a skewness toward high marks in conduct at year 
end. 

There Is significantly greater than chance matching of marks at the 
two times in all three areas, despite the fact that approximately half 
the cases are mismatched on marks at the two times. 



How Expectations Compare In Different Areas 



At midyear there Is significantly less agreement between children's 
expectations for reading and arithmetic (28^t) than would be predicted by 
chance (37^) (z = 2.29, P < .05). This stems mainly from the different 
percentages of I's and 2's expected In these two subjects (see margins 
of Table 6.19) for 799^ of children expect a "1" in reading while 609t 
expect a "2" in arithmetic. The reader should note that there Is 
significantly less than chance matching between expectations for 
reading and arithmetic, but significantly more than chance matching 
between marks in the two areas. 

There Is too little variance in children's expectations for conduct 
at the time of the first report card (93^ expect I's) confidently 
examine the agreement between conduct expectations and expectations in 
the substantive areas. At the end of the year the agreement between 
expectations for reading and for conduct, or between those for arithmetic 
and conduct, must be interpreted cautiously because of the lack of 
variance in conduct expectations. Eighty per cent expect "I's" in 



Conduct 
Mark, 
Time 1 



Table 6.I8 

Conduct Mark, Time 2 

123 Total Percent 

1 22 6 28 23.1 

2 33 31 12 76 62.8 

3 10 7 17 1^.0 

Total 55 ^7 19 121 

Percent 45. 5 38.8 15.7 100.0 
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Table 6. 19 
Child Arithmetic E^ectatlsn, Time 1 
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conduct, and expect "2's". In both cases the number of matches does 
not differ significantly from the number e:gpected by chance. 



What Is Impact of First Mark on E35)ectations Later? 



Reading 

Children's expectations at the end of the year are still exceedingly 
high"79?6 continue to look for the highest grade, and a very small 
percentage (10^) hope for the mark received earlier. This amount of 
agreement between expectations and earlier maiks is well within the 
range of chance expectation (8^^). Over 90^t still look for I's and 2's 
despite their teachers' having given no I's and given only 30^ 2's 
earlier. A very large proportion of those who received 3's and U's still 
look for I's and 2's (U2 out of U?, close to S^), The present analysis 
shows that feedback in the form of marks has little impact, for as already 
pointed out, expectations over the year are remarkably similar. 



Arithmetic 

Receiving a low mark in arithmetic earlier in the year (almost 
1% received a 3 or U) did not dainpen expectations much at the end of the 
year. Just as was true for reading, the agreement of later expectations 
with earlier marks is at chance level {2% observed and 19^ expected) . 
A sizeable fraction of those who received 3's and U's (over UlJ) continued 
to hope for a "1" mark at the end of the year. In fact a greater 
proportion {k2$) expected a "1" at year end than had expected a "1" at 
midyear Relatively few children (about 7^6) have expectations lower 

than the mark they received in arithmetic. More (about 7%) expect a 
higher grade than they received earlier. This degree f asymmetry around 
the main diagonal is highly significant (X = 60.8U, p < .001). 



Conduct 

Again, marks in conduct have a heavily skewed marginal with little 
variance to be explained. A sizeable fraction (8o^t) of children expect 
a "1" in conduct, a moderate reduction from the ^% who held this 
expectation earlier. It is interesting to note that this is the largest 
reduction in the percent of children expecting I's in any of the three 
areas and that this is the only area in which the teachers' mark 
distribution was not skewed toward low marks.' Matching of midyear mark 
with year end expectations (33^) is again not statistically different 
from the matches expected by chance (30^&). Asymmetry around the main 
diagonal remains significant (xj = 50.58, p < .001). 

In summary , the first marks the children received seem to have had 
little effect on their expectatlonF ! As noted previously the fact that 
the children's expectations are markedly shewed toward high expectations, 
while the teachers' mark distributions are not, mean^ that the large 
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Table 6.20 
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Table 6.21 
Arithmetic Expectation* Time 2 
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Conduct Esgpectatlon, Tine 2 

1 2 3 U Total Percent 

1 21 U 1 26 23.2 

Conduct 

Mark, 2 56 15 1 1 73 65.2 



Time 1 



3 13 13 11.6 

Total 90 19 2 1 112 
Percent 80.U 17.O 1.8 0.9 100.0 
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caiority of children receive negative feedback In the form of marks at 
midyear being lower than midyear expectations* A minority receive 
neutral feedback (mark equals e^cpectatlon) and a rare few receive 
positive feedback (mark exceeds expectation). Ea^ectatlons remain 
skewed toward high es^ectatlons despite the largely negative feedback. 

The lack of any significant deviation from chance matching In any 
of the areas suggests that the midyear mark holds no special appeal for 
the children. There Is no special (beyond chance) tendency for children 
to "adopt" their midyear mark 6.nd turn It Into a year end expectation . A 
more detailed exGunlnatlon of feedback effects Is presented later. 



Does Children's E39ectatlon-Mark Agreement 
Vary with Year End Mark Received? 



The type of analysis carried out in the corresponding section of 
Chapter 3 cannot be carried out here. Hhe combination of exceedingly 
high child expectations with relatively low mark assignments restricts 
the possible patterns of outcomes so severely that it is in general 
meaningless to attempt to assess whether children of different abilities 
differentially incorporate their actual performance (as viewed by the 
teacher) in the formation of their expectations. The extreme discrepances 
between children's expectations and marks suggests there is generally 
little incorporation of actual performance in the formulation of 
expectation, which leaves little hope of observing substantial numbers 
of reality- testing children at any particular ability level. 

In the case most amenable to analysis— that of arithmetic at year 
end— there is no hint of reality testing for children of any ability level. 
The expected pattern based on a "marginal probability" model is as follows. 



Hypothetical Pattern Based on a Marginal Probability Model 

(Arithmetic at Year End) 
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100 



The observed pattern approximates this very closely, indicating the 
children are not incorporating their performance (As viewed by the teacher) 
in the formulation of their expectations. The largest discrepance that 
occurs is for the high ability students (mark = 1), but this discrepance 
is in a direction opposite to that would be predicted on the basis of a 
reality testing model. 
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Table 6.23 



Arithmetic 
Mark, 
Time 2 



3 
k 



Expectation Minus Mark, Arithmetic, Time 2 
-3 



U6% 
13 



Total 13 



-2 


-1 


0 


+1 


-1-2 


+3 Total 






33% 
k 


50% 
o 


173t 


12 




16 


15 


11^ 




35 


hoi 

16 


3^ 
15 


15% 
6 


7% 

3 






36% 
10 


k 


1 






28 


26 


35 


26 


13 


2 


115 



100% 
lOQi 
100% 
100% 



Arithmetic 2 
Expectation, 
Time 2 

3 



Total 



Table 6,2k 

Expectation Minus Mark, Arithmetic, Time 2 



-3 


-2 


-1 


0 


+1 


-1-2 


■t-3 Total 


27% 
13 


33% 
16 


33% 
16 


Bi 
k 










22% 
10 


33% 
15 


33% 
15 


13% 
6 




lf6 






25% 
k 


38% 
6 

25% 
1 


25% 
if 

75% 
3 


13% 
2 


16 


13 


26 


35 


26 


13 


2 


115 



100% 
100% 
100% 
100% 
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Does Children's Expectation-Mark Agreement 
Vary with Expectation Level? 



The question being asked here is: "Do children of differing 
expectations levels differ in their ability to form realistic 
expectations?" Are those who hold moderate expectations, for example, 
also those who form realistic expectations? On the surface it might 
appear that those who expect I's are the most unrealistic given the low 
mark distribution. This is actually a misleading argument, however, 
since for tables like Table 6.2U, realism would be indicated by a shift 
in the distribution of the discrepance between expectation : and marks 
toward the "0" discrepance column. (This model is similar to that 
discussed in the preceeding section.) 

The only case that is not too severely restricted (by the skewed 
mark and expectation marginals) to preclude analysis is again that of 
arithmetic at year end. 

The marginal mark distribution for arithmetic is I's = 10^, 2's » 
30%, 3's = 3^, and U's = 2h%, If the mark distributions within each of 
the expectation levels are similar to one another and the marginal mark 
distribution (i.e., no expectation level displays realism) one would 
observe the following hypothetical pattern. 



Expectation 



Table 6.2Ua 
Hypothetical "Non- Realism" Pattern 
Arithmetic Time 2 

Expectation Minus Mark Discrepance 





-3 -2 


-1 


0 


+1 


+2 


+3 




1 


2h 35 


30 


10 








100 


2 


2k 


35 


30 


10 






100 


3 




2k 


35 


30 


10 




100 


U 






2k 


35 


30 


10 


100 



A shift toward the column headed "0" discrepance for any row would 
indicate the formation of realistic expectations for the level of 
expectations concerned. 

The actual data, presented in table 6.2h, indicate no such shifting. 
(Row k is not considered due to the small number of cases present.) 
The large majority or cells are within 3 percentage points of the expected 
values, the largest discrepance being 5 percentage points (row = 3, 
column ="+l). The fit is surprisingly good considering the N's Involved. 

Our conclusion for arithmetic at year end must be that we have no 
evidence to suggest that any particupar expectation levels foster the 
formation of realistic expectations . 
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The Effects of Feedback on Marks 



Reading 

As noted earlier, there Is an Increase In the average reading mark 
from midyear to end of year (3.15 (N « 72) vs. 2.59 (N = 127)). In fact 
as Table 6.25 Indicates, no child received a J.cwer mark at the end of the 
year than he had at midyear. If, as could be the case, these initial 
reading marks are entirely unlnterpretable by the child, one would expect 
no relation between ii^provement in marks and the midyear discrepance 
between expectation and mark. For the 57 cases where data are available 
this hypothesis is not contradicted as shown by a test on the collapsed 
table (X = 0.5U). The trend for the discrepant cases, however, in the 
non-collapsed table is suggestive. For 16 children with an earlier 
discrepance of 1, half ing)rove (8) and half (8) stay the same; for I6 
children with an earlier discrepance of 2, 63% inqarove; for I9 children 
with an earlier discrepance of 3, 7^ improve. Though not statistically 
significant, a consistent trend such as this warrants attention. 

Arithmetic 

The change in marks over the first grade year can likewise be seen 
best in a collapsed table (Table 6.26). Again analysis is hampered because 
almost no one did better than he e3g>ected at midyear. Here however, of 
those— the large majority— who did worse than they expected, a large 
fraction (39^) get better marks and only 9^ get lower marks. This is in 
contrast to what happens to children whose marks earlier were exactly what 
they expected. In this group (N = 12) get lower marks and 17^ show an 
increase. The numbers Involved are small but an exact test used on the 
extreme cells shows that of those whose marks change up or down, change 
is significantly related to the direction of the midyear discrepance 
(p < .01, exact test) (Table 6.27). Almost all those whose marks irnprove, 
earlier got less than they hoped for, while of those whose marks equalled 
what they expected, a larger number got lower marks than higher at year's 
end. 



Conduct 

Over one- third of the conduct marks rise over the year. As was true 
in the other areas, no child received a mark higher than he expected at 
midyear. Looking only at children whose marks change, one sees that there 
is a significant tendency for those whose marks iniprove to be recruited 
from among students whose marks were less than they hoped for at midyear 
(p < .056, exact test). 



Effects of Feedback on E^ectatlons 

Reading 

If one looks closely at what happens to children's expectations at 
the end of the first grade year in reading in relation to the discrepancy 
earlier between expectation for the first mark and the first mark received. 
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Table 6.23 (Collapsed from Table 6-A-l) 



Year End Reading 
Mark is B.S 



Better 
Same 



Expectation-Mark 
Discrepance Earlier 



0-1 

12 
10 

22 



2* 
2k 
11 

35 



36 
21 

57 



Table 6.26 (Collapsed from Table 6-A-2) 

Arithmetic Marks go 



Child Earlier 

Did B.S,W 

Than He Expected 





up 


Same 


Down 




Better 


0 


1 


0 


1 


Same 


2 


5 


5 


12 


Worse 


35 


U6 


8 


89 




37 


52 


13 


102 



Table 6.27 (Collapsed from Table 6.26) 







Arithmetic Marks go 
yp Dovm 




Child Earlier 

Did B.S.W 

Than He Expected 


Better 
Worse 


2 

35 


5 
8 


7 

U3 






37 


13 


5C 
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Table 6.28 (Collapsed from Table 6-A-3) 

Conduct Marks go 



Child Earlier 

Did S.W 

Than He Expected 



Same 
Worse 



UP 


Same 


Dovn 




3 


19 


5 


27 


3h 


31 


10 


75 


37 


50 


15 


102 
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the following observations emerge. In 5 instances where the child's mark 
exactly equalled his expectations, 3 children's expectations later rose 
at the end of first grade and 2 children's eaqpectations remained the same. 
In 50 instances where the child's expectations exceeded his mark, expec- 
tations remained stable in 31 instances, rose in 8 instances and declined 
in 11 instances. Thus in only 22^ of the cases when the first mark was 
lower than the child hoped, does the child modify his hopes downward. 
Evidence disconfirming hopes does not lead to changing hopes. In rio 
instance does the child's mark exceed his e:^ectationsI The changes in 
e3Cpectations fot reading over the first grade year in terms of the midyear 
mark-expectation discrepance is best seen in a collapsed table (Table 6.29). 
Again the analysis is hindered by the lack of some midyear marks in read- 
ing. A rather small percentage (20^) registered an increase in expecta- 
tions between midyear and end of year and of these 3 out of 11 cases had 
gotten what he expected at midyear, llie others had gotter less than they 
eacpected at midyear. Almost everyone did worse than he expected, and all 
of those whose esqpectations go down are recruited from this group. Over- 
all, however, the table is not very satisfactory because of the small 
number (5) of persons falling outside the "worse" row. 



Arithmetic 

Ihe same kind of analysis for arithmetic expectations yields the 
following. There are 11 instances where earlier mark and earlier expecta- 
tion exactly coincide. Of these, k children do not change their ea^pecta- 
tion and 3 hope for higher marks now. The remainder (2) are less 
optimistic than they were earlier. There are 6k Instances where the 
earlier mark did not come up to the earlier es^ectation. Of these 36 {k3%) 
maintain expectations at the same level and 22 (26^) increase . The 
remainder {31%) decrease. Only l8?t however look for C's and D's although 
earlier the teachers have given 72^ C's and D's. 

Arithmetic e^gpectations are more volatile than reading e3Q>ectations— 
about 29^ go up during the year con5)ared to 20^, and 29^fc go down during 
the year compared to 20^t. The greater upward movement is quite under stemd- 
able because the midyear expectations for arithmetic were nuxch more modest 
than reading expectations (1.7^ vs. 1.28). The greater downward movement 
of the discrepant cases may be attributable to the children having a 
clearer understanding of their low marks in arithmetic con^pared to reading 
where the marking system was somewhat confusing at midyear. 

Still, however, since only one child earlier did better than he 
expected, conclusions are limited. Looking only at the 55 children whose 
expectations changed (h cells) (neglecting the single positive discrepance) 
one sees some indication for an earliez negative discrepance to lead to 
decreases in expectations. Numbers however are small and an exact test 
shows the relationship to be non- significant. The most likely event is 
for expectations to remain stable. 

Conduct 



Expectations in conduct tend to decrease slightly over the year (1.07 
to 1.<''3). These expectations how more realism. Of 25 whose earlier 



Table 6.29 (Collapsed from Table 6-A-4) 

Reading Expectations Go 







Up 


Same 


Do»ni 




Child Earlier 


Better 


0 


0 


0 


0 


Did B.S«W 


Same 


3 


2 


0 


5 


Hum He E:Q)ected 


Worse 


8 


31 


u 


50 






11 


33 


11 


55 



Table 6.30 (Collapsed from Table 6-A-3) 

Arithmetic Expectations CrO 



Child Earlier 

Did B.S,W 

Than He Expected 





VP 


Same 


Down 




Better 


1 


0 


0 


1 


S€une 


5 


k 


2 


11 


Worse 


22 


36 


26 


8U 




28 


ko 


28 


96 



Table 6.31 (Collapsed from Table 6-A-6) 



Child Earlier 

Did S,W 

Than He Esqpected 



Same 
Worse 



Conduct E^ectatlons Go 
Up Same Down 
3 17 5 
1 57 13 

U 7k 18 



25 
71 

96 
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expectations matched the mark received, most (6856) expect the same mark 
again, 12% expect a higher grade, but 20% expect a lower mark. In 71 
instnaces where the expectation exceeded the mark, most maintain the 
expectation (57). All of the rese, with one exception, lower their hopes. 
Again, however, in no case did the mark exceed the expectation. When 
e:q?ectation-changes are viewed in terms of the midyear discrepeuice between 
mark and expectation, the collapsed table (Table 6.31) summarizes the 
findings. For those whose e:?pectations change , only 23% of the group, 
there is a tendency for expectations to be lowered when the previous mark 
was worse than expected. 

In the cases of reading and arithmetic, eJ^pectations were more 
resistant to "bad news", for those changing moved about equally in the 
two directions. In conduct with 13 out of Ik decreasing, the trend is 
significantly downward (z = 2.9U, p < .01) tested against a null 
hypothesis of changes up or down for those with negative discrepances. 



Summary of Feedback Effects 



For Marks 

In all three sets of marks, tho^e who improve tend to be those whose 
marks earlier were less than they expected. But those whose marks decline 
also tend to be those whose marks earlier were less than they e:gpected 
simply because most of the children received marks lower than they 
expected at midyear. 

However, in arithmetic and conduct significantly more students 
tended to show an improvement in their mark (as opposed to showing a 
decline) when there was an earlier discrepanfee between mark and expectations , 
compared to the improvements and declines that followed no discrepance. 
Too few cases with no discrepance are available in reading to place any 
confidence in the values observed there. 



For Expectations 

For the cases where e^qpectations changed, in all three areas 
expectations tended to decline (non-slgnliiv^antly) more if their was an 
Initial discrepance (over- estimation) than if there was no initial dls- 
crepance . 

For the children whose marks and expectations do not coincide at 
midj'ear there is a tendency for the changes in marks and expectations that 
do occur to be changes in a manner that decreases the original discrepance . 
For the children with no mark-expectation discrepance at midyear, no con- 
sistent pattern of change emerges for either marks or expectations. 
Varying) but sizeable, proportions of cases retain the same mark or 
e:g)ectation at year end as at midyear no matter what the relationship 
between the mark and the child's expectation at midyear . 
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Do Marks and Expectations Show a Disproportionate Likelihood of Slnailta^ 
neously Adjusting to Counteraot a Mark Esqpectatlon Discrepance ? 



It was previously shown that both marks and expectations show slight 
tendencies to move over the course of the first year In a direction to 
reduce discrepances between marks and e:s^ectatlons at midyear. These 
observations could arise In several ways. First, a child could change 
both his mark and his expectation In a direction to reduce midyear 
discrepance. Second, a child could change either e^ectatlons or marks. 
That Is, some children might maintain their Initial expectations and 
effect a mark change, while other children might maintain their original 
mark and adjust their e^ectations. 

Since all the discrepances reported in Table 6.32 are of the type 
where the child's expectation exceeds his mark, the column headed 
**Mark Up--E:Q)ectatlon Down" represents the first type of child described 
above. Hhe second type of children- -those who change either marks or 
esgpectatlons to reduce a discrepance— appear In the column headed *^rk 
Up— Expf^'^tatlon Same** (mark changes) and In the column headed **E^ecta- 
tlon Down—Mark Same" (expectation changes). Comparing Just these three 
columns (and neglecting all rows indicating no discrepance) one sees 
that results differ. Reading and conduct are rather similar in that the 
majority of the discrepance- reducing changes are of the mark-up-only type 
with the remainder of the cases about evenly split between the expectation- 
down-only and the mark-up-es^ectatlon-down types. The number of cases 
Involved in these two patterns are 19^ ^9 and 1 for reading and 29, U, and 
k for conduct. Arithmetic has approximately equal nuinbers of cases for 
all three types of patterns {13y 10, and 13 respectively). 

In all three areas the simple change types (mark or e^ectatlon 
change) together are more numerous than the double change types (mark and 
expectation chsi^nge). The safest general conclusion, given the small 
numbers involved and differences between areas, is that discrepance 
reduction occurs in various ways and data so far collected are not 
extensive enough to permit definitive findings. 

As a final notG» the reader should be cautioned that the preponderance 
of cases appearing in the coluzcns headed "Mark Up" compared to the columns 
headed "Mark Down" does not neces^mrily imply the necessity of "adjusting 
for leniency of year end marking". If causal efficacy is granted to 
expectations over the course of the year, the skewed expectation distri- 
bution would also explain some (and possibly a large) part of the prepon- 
derance of increasing marks over declining marks. 



Why does Feedback Have Little Effect on Expectations ? 

What may be happening to account for the "lack of reality testing" 
in reading? Low marks had almost no effect on children's expectations. 

Perhaps the children do not know how they have been evaluated. Hie 
marks for reading as recorded on report cards are actually almost 
inscrutable. The authors were able to interpret these marks only after 
consultation with teachers and the principal. As mentioned earlier, three 
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Table 6.32 

Mark-Expectation Discrepance at Time 1 by Mark and Expectation Changes 
Between Times 1 and 2: Reading, Arithmetic, and Conduct 



Reading 



3 

Mark 
Minus 
Expectation, 1 
Reading ^ 



Mark Up 

Expectations 
Up Same Down 

10 3 

1 6 3 
U 3 1 

2 1 



Mark Same 

Expectations 
Up Same Down 

2 3 

1 5 

2 5 1 
1 1 



Mark Down 

Expectations 
Up Same Down 



Arithmetic 



Mark ^ 

Minus ^ 

Expectation, 1 
Arithmetic 



Mark Up 

Expectations 
Up Same Down 

1 1 

2 5 7 

5 7 5 

1 1 



Mark Same 

Esgpectations 
Up Same Down 

k 2 

2 7 3 

9 11 5 
2 2 



Mark Down 

E3^ctation& 
Up Scuae Down 



k 
3 



1 
1 



3 
1 



Conduct 



Mark Up 



Mark Same 



Mark Down 



3 

Mark « 
Minus *^ 
Expectation, 1 
Conduct 



E^qpectations 
Up Same Down 



1 
2 



5 

2k 



Expectations 
Up Same Down 



k 

19 
11* 



k 
3 



E^ectations 
Up Same Down 



5 
3 



5 

2 



* One case where a student's mark exceeaea nis expectations in 
omitted from this portion of the table. 



o 

ERIC 



6-36 



Separate schemes were used for marking which reflected progress in a 
given set or reading materials. None of the schemes coincides with the 
A, B, C, or 1, 2, 3 codes for other subjects on the report card. The 
yiptar-unitp scheme reports the number of units completed. Only a person 
well versed in these instructional materials and in the general progress 
of first graders in reading could readily interpret this scale. . e 
other two scales are literal— P, PP, etc.— to indicate the book the child 
is currently reading. Again, only a practiced reading teacher familiar 
with different genre of introductory reading texts coulo evaluate the 
child's progress. To conqplicate things further, the two series using 
P, PP, etc. use the same letters (e.g., P) to stand for two different 
levels of progress. A "Primer" level in one series is not equivalent 
to a "Primer" level in the other. 

What children are receiving as feedback may therefore depend very 
little on recorded marks in reading. They may be relying almost 
exclusively on informal evaluations of the teacher given during classroom 
recitations. The Distar Program, interestingly enough, emphasizes lavish 
use of positive reinforcement: the teacher congratulating the child by 
shaking his hand, giving the child raisins or other small pieces of food, 
pinning signs on the child such as "I did well today", as well as the 
more usual types of reinforcement like a steady flow of verbal praise 
and smiles. This exaggerated kind of informal positive feedback may kindle 
the very high expectations the children continue to register at the end 
of the year and swamp any negative feedback from report cards, especially 
since the report cards are essentially uninterpretable to children and 
parents. 

Encouragement and positive feedback are certainly necessary to 
motivate children in day-to-day classroom efforts, but one wonders what 
effect the extreme dissonance between the teacher's classroom behavior 
and her forced use of an absolute level grade scale may eventually have. 
What happens to the child, for example, when later in the second or third 
grade he realizes what his earlier report cards actually signified Guid 
looks back upon a long series of low marks? 

While the unlnteligibility of marking schemes is a plausible explana- 
tion for the lack of a feedback effect in reading, the lack of feedback 
effects in arithmetic and conduct are more readily understood, especially 
as the year progresses. Here a plausible explanation for the lack of 
effect may lie in the informal positive feedback given by the teacher in 
daily classroom sessions outweighing the infrequent negative feedback on 
report cards. 



Children's Ability to Forecast 

How Does Discrepance at Middle of Year f )mpare with Later Discrepance in 
Mark-Expectation Relations ? 

Reading 

l^ble 6.33 displays how discrepances between expectations and marks 
behave over ti*»i first-grade year, or in other words whether children get 
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Table 6.33 (Collapsed from Table 6-A-7) 



End of Year 



Child Expected to do 
(Better, Same, Worse) 
Than He did in Reading 



Midyear 





B 


S 


B 


38 


2 


S 


8 


2 


W 


k 


1 




50 


5 



w 



10 
5 

55 



Table 6.3k (Collapsed from Table 6-A-8) 



End of Year 



Child Expected to do 
(Better, Same, Worse) 
Than He did in Arithmetic 

Midyear 



B 


S 


W 




B 56 


6 




62 


S 18 


3 


1 


22 


W 10 


2 




12 


8lv 


U 


1 


96 



Table 6.35 (Collapsed from Table 6-A-9) 



End of Year 



ERIC 



Child E3^cted to do 
(Better, Same, Worse) 
Than He did in Conduct 



Midyear 





B 


S 


B 


38 


k 


S 


28 


18 


W 


5 


3 




71 






() - 





w 



I 



U2 
U6 
8 

96 
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better at forecasting their marks. As before, no one at first marking 
g(;ts marks higher than he e3Q)ects— all get the same or lot/er. By the end 
or the year of those (50) who ejqaected to do better than they did initially, 
12— roughly one quarter— have a zero or positive discrepance, thus receiving 
a mark they expect or one better than they es^ect. The vast majority, 
however, who expected to do better earlier continue to find themselves 
receiving marks that do not live up to their e3g)ec tat ions. There i s some 
increase in the agreement between marks and escpectations (9^ got what 
they expected at midyear vs. l8^& at year end), but the strong trend 
toward over-optimism is mitigated only moderately. There is some 
undershooting at year's end, about Sff, expect to do worse than they 
actually do. These few are happily surprised. 



Arithmetic 

. A similar analysis is presented for arithmetic in Table 6.3^. This 
table has much the same pattern as the one for reading, but trends are 
clearer. First, there is considerable moderation in optimism by the end 
of the year- -35^ of children now show expectations the same or less than 
their mark compared to 13^ at midyear. Second, a majority (58^) of the 
children repeatedly expected better marks than they received. And third, 
of those who change (from agreement to disagreement, or the reverse), 8 
move away from agreement whiTs I9 move toward agreement. A X* test for 
related samples shows that this trend approaches the 3% level in signi- 
ficance (Xi - 3.70). There is, in other words, a distinct movement that 
makes expectations agree with marks even though 65^ remain over- optimistic. 

It is of interest to note that the rate of movement away from midyear 
mark- expectation agreement (8 of 11 or 73%) is considerably larger than 
the rate of movement away from midyear disagreement (I8 of 8h or 21%), 
That is, between midyear and year end students depart from the state of 
mark-expectation agreement at a faster rate than they depart from the 
state of mark-expectation disagreement. If these rates remained stable, 
more children would continually be found in a state of mark- e3q)ec tat ion 
disagreement than in the state of agreement. There is, however, reason 
to believe that at least one of these rates will change. In particular 
if the first agreements occurred by chance (or factors unrelated to 
performance) the high proportion of followup disagreements would be 
expected. As agreements come to be a function of the child's actual 
perception of his performance, however, followup agreements should 
become much more likely. That is, the rate of departure from the agreement 
state should decline. The incorporation of evidence now being collected 
will indicate whether such a shift does occur during the second year and 
indeed will also provide a larger case base on which the rates for the 
first year can be calculated. 

The reader should note that a similar pattern of a large oiigration 
away from agreement and a small migration away from disagreement occurs 
in ths case of reading. The small case base for the agreement state 
precluded discussion. 



I 



Conduct 



For conduct a suimnary is shown in Table 6.35. Again there is consid- 
erable movement toward agreement over the first-grade year with over- 
S^$i!sm decreasing from to-^i^ As before when the fend toward 
agreement is contrasted with the trend away from agreement, the trend 
tSJI^S Sgreement is confiiiaed, here at a high level of significance 
(Xj = 11.U3, P < .001). 

in conduct, unlike in reading and arithmetic, the rate ^ 
away from the state of mark- e:?pec tat ion agreement is smaller ^^.J^® f 
o^l^ovement away from the state of mark- expectation disagreement (7 
of 25 or 2% vs. 28 out of 71 or 39f>). In fact the rate of movement away 
?rom agreement is the smallest of the rates for the three areas and the 
rlTe of movement away from disagreement is the largest of the tte^^^ 
All these facts are consistent with an observation that children W »e 
miie able to oStain, comprehend, and incorporate if J""?*! on relevant to 
their conduct than to the two substantive areas. This interpi^etation is 
Sso suggested by the fact that conduct displays the lowest Percentage 
of repef?ed overlestimations (UO*) and the highest percentage of initial 
(midyear) agreements (26^t) of the three areas. 

in summary, there is evidence for arithmetic and 'JJ^^^t 
children-raccuiacy in forecasting increases significantly. (Too few 
caseflSe avaiUble to draw a conclusion for reading.) The absolute 
^^ts ofTcu^acy, however, even at the end of the year are only ^^out 

not overly impressive. Perhaps the most noticeable fact about all 
t2?;e ^eal lf tS at year end at least a few c^^i^Jjen forecast a 1^^^ 
m^rv than that received. No underestimates occur with respect to first 

Vconduft c^^^^ the least amount of repeated over- 

ZTi^tiln (wSich none the less is substantial in all areas) "well as 
thelreatest ability to maintain agreement between marks and ea^pectations. 

Parents' Expectations 

Parents are optimistic but guarded. As noted in Table 6.1 Par^ts' 
expectations are fairly stable across reading and arithmetic, 2.33 and 2.^:j 
rSSectively. They are somewhat more optimistic about conduct, I.89. 
S aJe also noticeably less variable for e:5)ectations in conduct, the 
X:ndrrd devLuon 0.35'being about half what is seen for reading or 
arithmetic. Most (kSfH) look for a B In reading, 52^ look for a B in 
arithmetic and 69f> look for a B in conduct. 

Parents' E x pectations vs. children 's Expectations 

As already noted in several places the children's expectations for 
readin ~duct are extremely high Because of this the eongn^ence 
exoected between parents and children in low (19^). The levei °; 
coSgr^ence actSliy observed {2%) is also low, not in fact significantly 
greater than would be predicted by chance. 
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Table 6.36 

Child's Reading Eacpectation, Time 1 
^23^ Total 



Percent 





1 


9 


1 


1 




U 


12.1* 


Parent's 


2 


32 


11 




1 


uu 


1*9.^ 


Reading 
Esqpectation, 


3 


22 


3 


2 




27 


30.3 


Time 1 


k 


7 








7 


7.9 




Total 


70 


15 


3 


1 


89 






Percent 


78.7 


l6.9 


3.U 


1.1 




100.0 



Table 6.37 



Parent's 
Arithmetic 
Expectation } 
Time 1 



Child's Arithmetic Expectation, Time 1 

3 Total Percent 





1 


2 


1 


7 


7 


2 


20 


26 


3 


5 


21 


U 




2 


Total 


32 


56 


Percent 


35.6 


62.2 



1 

« 

2 

2.2 



ll* 

hi 

27 
2 

90 



15.6 
52.2 
30.0 
2.2 

100.0 
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Table 6.38 



Child's Conduct E3g9ectatlon, Time 1 







I 


2 


Total 


Percent 




1 


19 




19 


21.1 


Parent ' s 


2 


56 


6 


62 


68.9 


Conduct 










Esipectation, 


3 


7 


1 


8 


8.9 


Time 1 












k 


1 




1 


1.1 




Total 


83 


7 


90 






Percent 


92.2 


7.8 




100.0 
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The agreement between parents and children is higher for arithmetic-* 
3b1^-but this again does not differ from chance prediction of agreement 
(39^)> The agreement between parents and children for conduct Is 28^, 
despite the fact that 92^ of the children look for a 1. This matching 
again does not significantly depart from the chance prediction of 
agreement {2%). 

These data contradict the hypothesis that parents' expectations for 
school performance detemlne what school preformemce the children come to 
e^ect for themselves at least very early In their school career. 



Parents* Expectations vs. Marks 
Reading 

As noted earlier there Is little correspondence between marks children 
expect In reading and those they receive. Likewise there Is little 
correspondence between e^qsectatlons of parents and the marks their children 
receive on their first report card for only 27% of the parents' expecta- 
tlons are borneout (compared to 2% expected by chance). Both this low 
level of expected matching and the asymmetry (parents' e:Q)ectations 
generally exceeding marks) are a result of the low level of reading marks 
initially assigned. A rather surprising thing is that 3 parents {%) 
expect their child to get the lowest mark and in all 3 instances this 
expectation is confirmed. 

Later at the end of the first grade year the amount of agreement 
between parents' expectations and marks obtained Inreadlng goes up to ^2^. 
This of course is highly significant (z = h,k2, p < .01). It is somewhat 
surprising that parents' midyear expectations significantly match year end 
reading marks but do not significantly match midyear reading marks. The 
resolution of this anomaly is the fact that excessively low marks were 
granted in reading at midyear while a mark distribution only moderately 
lower than usual appears at year end. 

At year end significantly more (x| = '^•^O, p < ,03) of those parents 
whose expectations are non-veridical are disappointed (67^) rather than 
the reverse. A surprisingly large percentage {7%) expect their child to 
do poorly axid over half of these children do (U out of 7). 



Arithmetic 

In arithmetic one finds at midyear only about chance agreement between 
parents' forecasts of marks and the marks their children obtain (29^ match 
conpared to 2d% matches expected by chance) . Again the mark distribution 
is quite low, 7% of the children receive 3's or U's, resulting in parents' 
seemingly overestimating their children's midyear marks (X^ » U6.1, p < .01). 

At year's end the amount of agreement (32^) does not significantly 
exceed chance level (27%) . Since the mark distribution only rises slightly 
by year end (6o% now receive 3's or 4's) t^ere remains a marked asymmetry 
in the mismatched parents' esgpectations (X, = 18.23, P < .01). More 



Table 6.39 

Reading Mark, Time 1 
1 2 3 Total Percent 





1 


7 


3 


2 


12 


18.7 


Parent 




12 










Reading 


2 


11 


10 


33 


51.6 


Expectation, 
Time 1 


3 


2 


2 


12 


16 


25.0 




k 






3 


3 






Total 


21 


16 


27 


6U 






Percent 


32.8 


25.0 


U2.2 




100.0 



t 

Table 6.4o 



Reading Mark, Time 2 
12 3 4 Total Percent 





1 


5 


7 


1 


1 


llf 


13.5 


Parent 


2 


5 


30 


11 


U 


50 


U8.1 


Reading 
















Expectation, 


3 




o 


15 


9 


33 


31.7 


Time 1 


















k 




1 


2 


i» 


7 


6.7 




Total 


10 


hi 


29 


18 


lOU 






Percent 


9.6 


U5.2 


27.9 


17.3 




100.0 
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Table 6M 



Arithmetic Mark, Time 1 
12 3 4 Total 



Percent 





1 




8 


5 


k 


17 


16.3 


Parent 
Arithmetic 


2 


1 


11 


28 


11 


51 


49.0 


Expectation, 
Time 1 


3 




6 


17 


10 


33 


31.7 




k 








3 


3 


2.9 




Total 


1 


25 


50 


28 


16k 






Percent 


1.0 


2k,0 


W.l 


26.9 




100.0 



Table 6,k2 



Arithmetic Mark, Time 2 
1234 Total 



Percent 





1 


3 


6 


7 


1 


17 


16.2 


Parent 
Arithmetic 
Expectation, 
Time 1 


2 
3 


8 
2 


17 
6 


15 
12 


12 
13 


52 
33 






k 






1 


2 


3 


2.9 




Total 


13 


29 


35 


28 


105 






Percent 


12. U 


27.6 


33.3 


26.7 




100.0 
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Table 6.43 



Parent 
Conduct 
Expectation, 
Time 1 



1 
2 

3 
k 



Conduct Mark, Time 1 
12 3 Total 



9 
13 



15 
50 

5 



8 
3 



Total 23 70 11 
Percent 22.1 67.3 10.6 



2k 
71 
8 
1 

10k 



Percent 

23.1 
68.3 
7.7 
1.0 

100.0 



Table 6M 

Conduct Mark, Time 2 
12 3 Total Percent 





1 


17 


6 


1 


2k 


22.9 


P&rent 
Conduct 
Expectation, 
Time 1 


2 
3 


31 
1 


28 

k 


11 
5 


70 
10 


66.7 
9.5 




k 


1 






1 


1.0 




Total 


50 


38 


17 


105 






Percent 


lf7.6 


36.2 


16.2 




100.0 
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children (31^) do worse than expected if the parents* forecast is incorrect, 
and only 17% do better. Parents are lookiig for more B's in arithmetic 
than teachers award vs. 28^) and they look for very few failures but 
a large number are recorded (3^^ vs. 60^). 



Conduct 

Parents* e3<pectations for conduct match the marks given in terms of 
marginal distributions— parents es^ect the teachers to behave as they do 
in aseigninf marks here. There is in adr^ition a better-than-chance 
ability of a parent to forecast his child* s conduct mark, with of the 
predictions being accurate. This accuracy exceeds chance (3^) at the % 
level of significance (z = 1.97). Also there is roughly an even division 
among parents who are over- optimistic and over-pessimistic in this area. 

At the end of first grade ther 1-s again significant matching between 
parents' expectations and children's marks in conduct (37^ matching 
expected and k% is observed; z - 2.62, p < .01). Here, however, despite 
this agreement parents' display under-optlmlsm. Significantly mor| (33^) 
children exceed their parents' hopes than disappoint them (17^) (X^ = 3.89, 
P < .05). 

In sum while parents eaqaectations consistently display above chance 
notching with marks in conduct and consistently display only chance levels 
of matching with marks in arithmetic , reading displays signlficemtly 
above chance matching only at year end . All the cases of non- significant 
matchings are accompamied by discrepant mark and expectation marginal 
distributions— the discrepances being quite radical in the case of midyear 
reading and arithmetic. Hhe marginal discrepances (low marks vs. moderate 
expectations) result in parents esqpectlng higher marks than their children 
receive in reading and arithmetic at both midyear and year end. In conduct 
no symmetry is present at midyear while conservative asymmetry (lower 
expectations than marks) appears at year end. 



Sex Differences 



Midyear Marks 

At first marking the assignment of marks in reading does not show 
association with sex. In arithmetic there is likewise no significant 
association between marks and sex, although a few more of the high marks 
go to girls and a few more of the low marks go to boys. In conduct there 
is a strong association (X^ = V.2U, p < .01) between high marks and 
femaleness when the split between the sexes is conqsared for A's vs. C's 
only. Over two thirds of all the A's go to girls and more than two- thirds 
of the C's go to boys. While the extreme marks display a highly 
significant association with sex the overall mark distribution is only 
moderately significant (X^ = 7.25, p < .05) due to the similarity of the 
bulk of the cases which receive an average mark. 
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Table 6M 



Sex 



Conduct Mark, Tine 1 





1 


2 


3 


Total 


Percent 


Male 


9 


39 


Ik 


62 


U8.1 


Female 


20 


hi 


6 


67 


51.9 


Total 


29 


80 


20 


129 




Percent 


22.5 


62.0 


15.5 




100.0 
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Midyear Expectations 



In terms of children's e:Q)ectatlon8 in reading^ a few boys (about 10^) 
expect to do poorly, whereas only one girl (2^) does— otherwise the 
distributions are very similar. In arithmetic and conduct the escpectations 
for the two sexes look almost identical. 



Midyear Marks by Expectations 

The agreement between expectations for reeullng and marks in reading 
is the same for the two sexes— agreement levels are almost Identical with 
chance e^^ectancy in both sexes. Also the tendency to over-optimism is 
strongly present In both sexes .and present to Just about the sane degree. 

In arithmetic e^gpectatiohs, over-optimism appears but Is less 
pronounced than for reading. There is, furthermore, no teiAdency toward 
significant matching between expectations and marks in arithmetic for boys 
or girls. The tables for the sexes are very similar. 

In conduct, as already noted, more boys than girls get the lowest 
grade, but the variance in expectations is so minimal that it is not 
possible for sex differences to appear. 

In sum, boys and girls are virtually identical in the relationships 
displayed between midyear marks and e^gpectations for all three areas. 



Year Knd Marks 

At second marking the assignment of marks in reading shows a tendency 
toward an association of higher marks with females. While this association 
does not attain conventional significance levels (Xj - 3*2^, N.S.) it is 
interesting that the assignment of the very highest and very lowest marks 
is almost exactly equivalent by sex of child, while the intermediate meurks, 
2's and 3's, are the areas in which the association is tending to appear. 

At year end the assignment of marks in arithmetic favors girls 
(average girl's mark is 2.72 compared to boy's average of 2.83), but the 
difference is very slight. 

A noticeable and lavge difference by sex occurs for marks in conduct. 
The highly significant association (Xj = 18.17, P < .01) between sex and 
conduct marks in the e:q>ected direction — girls disproportionately receiving 
the high marks, boys the low. Girls receive two thirds of the high marks 
and boys receive 06$ of the low marks. 



Year End Expectations 



In terms of reading expectations at the end of grado one, the distri- 
butions for the two sexes are virtually identical. For arithmetic 
expectations, however, in contrast to midyear when ohe two sexes were the 
same, there is now a tendency for girls to look for the higher grades and 
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Table 6.U6 



Reading Mark* Time 2 



Sex 



Sex 



Sex 





1 


2 

mm 




li 






Male 


6 


22 


2k 


u 


63 


1^9.6 


Female 


5 


31 


16 


12 


6i» 


50.it 


Total 


11 


53 


UO 


23 


127 




Percent 


8.7 


Ul.7 


31.5 


18.1 




100.0 






Table 6.U7 










Arithmetic Mark, 
12 3 


Time 2 
k 


Total 


Percent 


Male 


7 


17 


19 


20 


63 


49.6 


Female 


6 


21 


22 


15 




50. U 


Total 


13 


38 


kl 


35 


127 




Percent 


10.2 


29.9 


32.3 


27.6 




100.0 






Table 6M 










1 


Conduct Mark) Time 2 
2 3 Total 


Percent 


Male 


18 


27 


18 


63 


li9.6 




Female 


38 


23 


3 


6U 


50.4 




Total 


56 


50 


21 


127 






Percent 




39.^ 


16.5 




100.0 
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boys to look for tlie lower. This association is nonsignificant (X3 « 3*19$ 
N.S.), however. No sex differences in conduct expectations appear at the 
and of the year» and the midyear pattern Is thus maintained. 

In sum , for sex differences so far, the sexes seem almost identical 
in terms of both midyear and year end marks and expectations for reading 
and arithmetic . In conduct , while the expectations are the same, girls 
tend to get higher marks than boys at both midyear and year end . 

At midyear boys and girls seem to display nearly identical relations 
between their marks and expectation— both being at about the chance level 
of matching and asymmetric in that expectations generally exceed marks. 



Sex Differences in Parents' E^ectatif^ns, 
and Congruence of Expectations Between Pareni&s and Children 



Reading 

Parents expect boys and girls to do about the same in reading. They 
expect boys to average 2.37 and girls to average 2.30. The sex distribu- 
tions do not differ much within mark categories, and the difference Is 
nonsignificant . 

In terms of how parents' expectations agree with children's expecta- 
tions within sex groups, there is almost exactly chance agreement for 
boys but significantly better than chance agreement for girls (z - 2.27, 
p < .05). While girls' and parents' expectations coincide to a signifi- 
cantly greater than chance degree, and boys' and parents' expectations do 
not, the absolute level of matching remains low In both cases (27^ for 
girls, 22^ for boys). In sOl cases for girls when parent and child 
disagree, girls expect to do better. Indeed, there is more skewing of 
girls' expectations, for none expects less than a B, whereas 10^ of boys 
expect a C or D. Boys, if they differ from their parents, are proi|e to 
have higher e3q>ectations for reading on their first report card (Xi - 19*33 
p < .01) . 



Arithmetic 

For arithmetic parents' expectations are almost identical for the two 
sexes, with a slight shading (2.17 vs. 2.28) in favor of boys. The 
average e3q>ectations are close to those in reading for girls, but are 
slightly higher for boys. 

In terms of hew parents' and children's expectations agree, again for 
boys there is no significant concordance (39^ observed and hl% expected). 
As with reading, in cases of disagreement between parent and child there 
is a much stronger -^encency for a boy's expectation to exceed his parents' 
than the reverse (X^ = 5.76, p < .05). 
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Table 6.^^ 
Parent's Rttadlng iii^^ectation, Time 1 





1 


2 


3 


k 


Total 


Percent 


Male 


9 


20 


18 


5 


52 


^9.5 


Female 


5 


30 


15 


3 


53 


50.5 


Total 


Ik 


50 


33 


8 


105 




Percent 


13.3 




31.^ 


7.6 




100.0 



Table 6.50 



Sex 



Male 
Female 

Total 
Percent 



Parent's Arithmetic Ea^pectation, Time 1 



1 2 

9 P.6 
8 26 



3 
16 
17 



k 
1 
3 



17 52 33 h 
16.0 k9.1 31.1 3.8 



Total Percent 
52 U9.I 
5^ 50.9 



106 



100.0 



Table 6. 51 
Parent's Conduct Expectation, Time 1 





1 


2 


3 


k 


Total 


Percent 


Male 


11 


35 


6 




52 


U9.I 


Female 


13 


36 


k 


1 


5U 


50.9 


Total 


2l| 


71 


10 


1 


106 




Percent 


22.6 


67.0 


9.h 


0.9 




100.0 
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For girls the concordance with parents for arithmetic marks is at 
chance levels (37^ observed vs. 36^ expected). There is thus no significant 
above chance agreement between parents and girls as there was for reading. 
The tendency for girls' hopes to exceed parents' is seen again here ev^n 
more strongly --of the 31 unmatched cases 9^% exceed parents' hopes (Xi « 
21.81, p < .01). 



Conduct 

Surprisingly, parents expect about the same marks for boys as for girls 
in conduct (I.90 for boys and 1.8? for girls). Because children's 
expectations in conduct are so markedly skewed, it is pointless to look 
at concordance with parents or to test the significsAce of the narked 
asymmetry. 



Sex Differences in Mark Received vs. Parents Expectations 



As noted earlier there is not a significant degree of congruence 
between parents' expectations and the marks their children actually 
receive in reading. This lack of agreement is noted for both sexes. 
Parents' forecasts are at chance levels for boys ^id girls (26^ correct 
for boys, 27% correct for girls). 

For arithmetic parents' expectations and children's marks also 
agree no more than would be expected by chance emd there is no differential 
accuracy of prediction by sex {2h% observed vs. 23% expected for boys, and 
3% observed vs. 30^ e^cpected for girls). Asymmetry appears in the tables 
comparing parents' reading and arithmetic expectations with marks (expec- 
tations generally exceeding marks) due to the low marks assigned in these 
areas at the middle of first grade. 

In conduct a picture emerges indicating that parents' predictions 
for girls are more accurate than for boys. The agreement between parents' 
forecasts for boys' conduct marks and the marks received is at a level 
attributable to chance {32% observed vs. 53^ expected). For girls, however, 
the agreement {67%) significantly exceeds chance (^1^) with z - 2.7U, p < .01 

To sum up the last two sections: Parents expect boys cuid girls to 
do the same in all three areas. Only for reading do parents' expectations 
and girls' expectations for themselves agree more often than would be 
expected by chance. Parents and boys' expectations do not coincide in 
any area. In terms of agreement with marks received, parents' forecasts 
are at chance level for both sexes in both reading and arithmetic. Parents 
can forecast girls' conduct marks with better than chance success, but not 
boys ' I 
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Black/White Differences 



About 60^ of children in the first grade classes are black. Two 
teacher are white, two black. 



Differences in First Marks 
Reading 

When marks are given for the first time, there is no association 
between the child's race and his mark in reading. The proportion receiving 
various marks is roughly matched to the racial division. In particular 
the numbers of blacks and whites receiving the lowest mark exactly 
matches the margins on race. 

Arithmetic 

Arithmetic marks show some tendency toward association with race. 
Half of the highest marks assigned (2's) go to whites (who coinpose kO% of 
the students). Over 70^6 of the k'a assigned go to blacks (who con5)ose 
5% of the students). There is about a lOff, bias evident at the two ends 
of the marking scale, with a tendency foy whites to get the higher marks 
and blacks to get the lower marks. A x test of association based only 
on the two extreme marks (N = 65) gives the value 2,99, significant 
only beyond the .10 level. 



Conduct 

The assignment of marks in conduct shows an even stranger racial 
bias of the same sort, for 3% of the I's are given to whites and l&jl, of 
the lowest marks (3's) are given to blacks. (No k's are given to anyone.) 
There is a 3.5-18^ bias at the two ends of the scale. A X test performed 
only on the extreme cells gives a value of k.jk, which is significant beyo 
the % level. ' 



Marks at End of Grade One 
Reading 

At the end of the first-grade year, marks in reading begin to show 
an association with race not present earlier. There are more 2's given 
to whites and more 3's and U's given to blacks than the margins predict. 
The association between leve^ of mark and race, however, does not differ 
significantly from chance (X3 = 6.27, P < .10). Another way to see the 
type of association by race that may be developing here, is to note that 
the agreement between marks given on the two occasions for whites is at 
chance level (31^) but for the 69^ manifesting disagreement, all move up . 
For blacks there is a similar upward movement I'or all the unmatched cases 
but the percentage agreement (U3^) significantly exceeds chance (26^, 
z =2.39, P < .05). Teachers are more likely to give a black child the 
same mark he received earlier or a higher mark but they are more likely 
to give a white child only a higher mark. 
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Table 6.^2 
Reading Mark, Time 1 

2 3^ Total Percent 





White 


9 


7 


13 


29 


40.8 


Race 
















Black 


11 


12 


19 


42 


59.2 




Toted 


2Q 


19 


32 


71 






Percent 


28.2 


26.8 


45.1 




100.0 
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Table 6.^3 










Arithmetic Nark, Time 1 








2 


3 


4 


Total 


Percent 




White 


15 


26 


10 


51 


40.8 


^ce 
















JiL&CK 


15 


3k 


25 


74 






Total 


30 


60 


35 


125 






Percent 


24.0 


48.0 


28.0 




100.0 






Table 6.5^ 










Conduct Mark; 


, Time 


1 








1 


2 


3 


Total 


Percent 




White 


16 


31 


4 


51 


40.8 


Race 


Black 


13 


47 


14 


74 


59.2 




Total 


29 


78 


18 


125 






Percent 


23.2 


62.4 


14.4 




100.0 
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Table 6.33 

Heading M&rk, Time 2 



Race 





1 


2 


3 


k 


Total 


Percent 


White 


If 


27 


11 


7 


h9 


39.2 


Black 


7 


25 


28 


16 


76 


60.8 


Total 


U 


52 


39 


23 


125 




Percent 


8.8 


Ul.6 


31.2 


18.U 




100.0 




Table 6.^6 












Arithmetic 


Mark) 


Time 2 






1 


2 


3 


k 


Total 


Percent 


White 


5 


19 


16 


9 




39.2 


Black 


7 


19 


25 


25 


76 


60.8 


Total 


12 


38 


hi 


3»+ 


125 




Percent 


9.6 


30. U 


32.8 


27.2 




100.0 



Table 6.57 

Conduct Mark, Time 2 





1 


2 


3 


Total 


Percent 


White 


26 


17 


6 


h9 


39.2 


Black 


29 


33 


Ik 


76 


60.8 


To\;al 


55 


50 


20 


125 




Percent 


hh.O 


UO.O 


16.0 




100.0 






6 - 56 


< .1 


-* 





ERIC 



Arithmetic 



The picture changes little from the picture at midyear, with again 
whites getting more 2's (about 10^) ar.d blacks getting more U's (about 
10^) than would be predicted form the margins. The distributions of I's 
and 3's however is almost exactly what the marginals would predict so an 
overall X test shows no significant association (X3 - ^'l?) N.S.). 
Ttiere is a significant matching of later marks with earlier marks for both 
races (3%, z = 3.^9, P < .01 for whites; 53^, z 3.90, p < .01 for 
blAClun), and also a significant tendency for marks to improve in both 
races. Thirty four per cent of whites get higher marks in arithmetic at 
the end of the year than in the middle (X? = 5.76, p < .05) and 33% of 
blacks do so (Xj = ^.97» P < .05). The lecture in terms of movement 
or change over the year in arithmetic looks very similar for whites 
end blacks. 



Conduct 

At the end of the year a f|w more I's go to whites and 3*8 to blacks 
than one would predlcl:, but a X test performed on the extremes of the 
table is not significant (X^ = 1,1k, N.S.). Earlier it was significant. 
There is a tendency for marks in conduct of both blacks and whites, if 
the marks cahnge, to change upward but this reaches conventional signifi- 
cance levels for blacks only (X^ S.OU, N.S. for whites; X? = 5.02, p < 
.05 for blacks). There is more agreement (no change) for whites than 
would be predicted by chance (55^ agreement vs. 39^ predicted, z ~ 2.21, 
p < .05). There is not significantly more agreement for blacks (46^ agree- 
ment vs. 38^ predicted) . To sum up for conduct, at midyear more of the 
lowest marks (compared to highest marks) went to blacks. At the end of 
the year both groups have moved up (significantly in the case of blacks) 
but relatively fewer blacks remained at the same level. About the 
seme percentage of blacks get 3's on each occasion (19^ initially, l8^ 
at year end), but a few more whites get 3's at the end (12^) than in the 
middle (6^) of the year. 

To sum uP i marks in general show no association with race at either 
midyear or year end. In several places whites seem to be disproportionately 
assigned high marks and blacks low marks but this only attains significance 
foi' midyear conduct. The pattern of changes in marks over the first year 
(as indicated by asymmetry) are the same for both blacks and whites in 
all areas. All asymmetry is of the type where year end mark exceeds 
midyear mark. There are differences by race in the matching of marks at 
the two times. Only blacks show significant mark matching over the year 
in reading while only whites show significant mark matching in conduct. 



Race and Children's Expectations 
Before the First Report Card 

There is almost no difference by race in expectations for reading. 
Most of both races expect the highest mark, 80^ of whites and 81^ of blacks 
have this expectation. In arithmetic there is ali;o Httlc difference in 
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expectations by race although the few children (^$) who do have low 
esw-pectations are all white. One-hundred per cent of the blacks look for 
an A or B in arithmetic. The two distributions of conduct es^ectations 
by race are also almost identical, with virtually all (92^) in both groups 
looking for an A. Of the 8^ who look for less than an A, all expect B's. 

In 8\m: black and white children do not differ in their expectations. 



How Do Initial Expectations Conform 
with First Marks for Blacks and Whites? 



Reading 

For both racial groups the agreement between guesses of marks 
received is within the range of chance e:^ectancy for whites and 7% 
for blacks). For white children in 20 out of 21 cases, the mark received 
is lower than that expected. For black children 3^ out of 38 children 
receive a lower mark than expected. The most noticeable discrepancy for 
both races is the large number expecting I's who get k'a, 43^ of whites 
and 32^ of blacks. 



Arithmetic 

There is not much agreement between the arithmetic mark e:Q>ected 
and the mark received for whites or blacks. The agreement for whites 
is 10^ with 19^ predicted, which is within the range of chance. For blacks 
it is 11^ with 1% predicted— again within the rcmge of chance. About 10^ 
of whites guess they will receive a 3 or U, but no black child looks for 
less theui a 2 in arithmetic . In both groups most children are looking for 
a higher mark than they receive. Teachers gave whites better marks than 
blacks in eurithmetic, 20^ of wfiites vs. 33^ of blacks receive 4's, but 
this difference is not statistically signlflccuit. 



Conduct 

In conduct the picture is uninteresting, again because there is no 
variance in children's expectations. Ninety-three per cent of whites and 
92^ of blacks look for the highest possible mark. Teachers use more high 
marks in this area them in reading or arithmetic, giving 9% of the 
whites I's or 2's emd 86^ of the blacks I's or 2's. The difference between 
assignment of high and low marks to blacks and whites is significant at 
the .0^ level using Fisher's exact test. 

In sum « in all areas for both races only a small amount of matching 
between marks and expectations is e3Q>ected by chemce due to the skewed 
marginals (marks being low, expectations high) and the observed matches 
do not differ significantly from these low expected values. The skewed 
marginals produce a strong asymmetry (expectations exceeding marks) in . 
all arep.s for both races. 
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How Consistent Are Expectations Between the Middle 
and the End of First Grade for Blacks and Whites? 



Reading 

For children of both races the consistency between midyear and end 
of year readf.ng expectations Is about 6% (62^ for whites cuid 73% for 
blacks). This consistency, however, Is not significant since It Is 
forced by the lack of variance In children's guesses on both occasions. 
(67^ agreement for whites and 66^ for blacks would be expected by chance.) 
There Is about equal movement in both races to Increase or decrease 
expectations from one time to the next. The probable lack of Intelligibility 
to children or parents resulting from the complex narking system In 
reading, discussed earlier. Is relevant also In accounting for the high 
consistency of expectations. No events have occurred to disrupt consis- 
tently high esgpectations. 



Arithmetic 

There is UO^ agreement in e:gpectatlons for arithmetic between the 
middle and end of first grade for white children. This agreement is 
exactly whet chance predicts. There is h2^ agreement for blacks, with 
kl% predicted by chance. Children's expectations In both groups move 
up and down about equally during the first-grade year, but the movements 
in black children ' s es^ectations are more noticeable because none of 
them esgpected less than a 2 at the first interview. At the second 
interview 17^ expect a 3 or 4. About of white children on first 
interview expect a 3 or 4 and 11^ on second interview expect a 3 or U. 
Tiie numbers involved are too small to warrant testing. 



Conduct 

For both races, there is little variance, but both groups tend to 
modify their high hopes to some degree. At midyear 9^ of whites expect 
a "1", and 83^ persist in this hope at the end of first grade. At midyear 
93^ of blacks expect a "1", and 8o^ have this e^epectation at the end of 
the year. Matching is high but not significant. 

In sum ; blacks and whites behave almost Identically in terms of 
expectations compared over the first year . Highly skewed marginals result 
in high chance agreements between expectations at midyear and year end. 
Observed levels of matching do not differ from those expected by chance. 



Racial Differences in the Influence of Marks on E^cpectatlons 
Reading 

Both races remain overly optimistic about reading marks after receiving 
low marks at midyear. The percentage of children holding a year end exp^c- 
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tation that is two or more points above their midyear mark is 5^*^ for 
whites and 68^t for blacks. Since white children received slightly (not 
significantly) higher marks at midyear these percentages seem quite in 
line with a statement that whites and blacks are identical in their 
adjustment of expectations to earlier marks, (The magnitude of the mark 
discrepeuice between the races was such as to give whites about 10^ more 
2's (the top mark assigned) and 6^ fewer k'a than blacks,) 



Arithmetic 

Results here again appear about the same for children of both races. 
Thirty-one per cent of whites and kdji, of blacks hold a year end es^pectation 
2 or more points above their midyear mark. Again, however, whites had 
received slightly (not significantly) better marks at midyear. They 
received 13% fewer J|'s and 11^ more 2's (the highest mark assigned) than 
did the blacks. This marking discrepance may not be enough to completely 
offset the observed over-optimism difference. If it is not— which is hard 
to jud«e due to the small N's involved—blacks would seem to exhibit more 
over-optimism (less realism) than whites. 



Conduct 

Teachers give both races mostly 2's in conduct {6k% for whites, 
66% for blacks) but blacks receive 15?t 3's and 19?t I's coinpared to whites' 
h% and 31^ respectively. Both races look for I's at year end {82f% for 
whites, BOf) for blacks). Blacks are over-optimistic at year end in 15^6 
of the cases while whites are over-optimistic in only k% of the cases. 
It seems the differences in assigned marks are sufficient to account 
for the differences in over-optimism between the races— but again the N 
is small and conclusions are only tentative. 

To sum up ; blacks consistently display a greater probability of 
over-optimism at year end (i,e,, year end expectations exceed midyear 
mark by 2 or more points) but this difference is largely accounted for by 
the fact that blacks generally received slightly more low marks, (At 
year end about 80% of children expect I's in reading and conduct, and kO% 
expect I's in arithmetic, for both races,) The most likely conclusion is 
that the races do not differ in their response to first marks - - but the N's 
involved are small and as more data become available the black children's 
disproportionate over-optimism nay attain significance . 

Matching, which in this case indicates the adopting of a midyear mark 
as a year end expectation, occurs at chance levels in all areas for both 
races. 



Teacher Differences 



Fortunately two of the first-grade teachers were white and two black. 
This permits some limited comparisons in terms of same-race and cross-race 
effects between teachers and students, and also allows some inter- teacher 
comparisons. 
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Differences In First Marks 



Reading 

One black teacher and one white teacher assign approximately equal 
nmbers of 2*8, 3's, and U's. The second white teacher gave no marks In 
reading at the time of the first report card. The second black teacher 
narked only about half the class and those who received a mark tended 
to get the lowest mark* 

Arithmetic 

The marks given in arithmetic vary considerably among teachers. Two 
white teachers have distributions that are not widely different 17f 8 
and 9f l6, 10, for 2's, 3's, and U's respectively). One black teacher 
marks harder than all the other teachers, and gave a "4" to slightly 
lean than half the class. The other black teacher assigned the most 2's 
and 3's of all the teachers. Thus the two black teachers bracket the 
white teachers, one being noticeably easier and one noticeably harder 
than the white teachers. The difference in mark distributions (11, 13t ^> 
and 7f 12, 13) between the two black teachers is significant (Xj " 7*36, 
P < .05). 

Conduct 

In conduct the black teacher who was an "easy" marker in arithmetic, 
used one category (2) almost exclusively. The other black teacher's 
dljtributlon closely resembles that for the two white teachers. All 
give rhout the same proportion {1%) of 3's (the lowest mark used), a few 
more I's (about 30^) and the majority 2's (about 531^). 

In sum while a few teacher differences in marking practices appear 
(botn in severity of marking and range of marks used) the data do not 
.ihow strong differences in marking practices by race of teacher. There 
are as yet too Tew cases for firm conclusions, however. 



Children's Expectations at Midyear 

There is not much variance among classrooms in tenrs of children's 
reading expectation, for as noted in other analyses, a very large percentage 
(76^) expect the highest mark. There is slightly less optimism in the 
ulass of the black teacher who turns out to be most lenient in marking— 
63^ of the children in her group expect the highest mark coispared to an 
average of 83^ in the other three rooms. It may be iiaportant for a teacher 
to convey "realistic" expectations to children, thus avoiding serious 
disappointment as this tearher does. 

In arithmetic children's expectations are more modesb, 3^ looking 
for the highest mark. About 60^ of children e:Q}ect a "2". These 
percentages do not vary much across classrooms, or by race- of -teacher, 
as the table below indicates: « ' 
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Arithmetic Nftrk Expected, Time 1 







_1 




-2 


Race of 


Black 


16 


29 


3 


Teacher 


White 


18 


31 


k 



All children held high expectations for conduct and there is little 
variation from one classroom to the next* 



Children's Expectations at the End of First Grade 

For three teachers the distributions of children's expectations in 
reading look similar on both first and second elicitation. The fourth 
teacher, who is white, has most of the children in the entire grade {3 of 
7) who have low expectations (for a 3 or 4) at the end of the year. At 
midyear no children in her class expected a 3 or a Interestingly in 
this class the regular teacher was absent a large fraction of the year. 
The number of children involved however, is too small to be more than 
suggestive. 



Changes in the Number of Children Expressing^ Different Arithmetic Expec- 
tations from First to Second Elicitation 



Esqpeetatlon 



Teacher 


Race 


1 


? 


3 


k 


1 


(w) 


5 


-3 


3 


1 


2 


(w) 


8 


-10 


1 


0 


3 


(b) 


0 


-5 


1 


1 


k 


(b) 


-2 


-1* 


6 


0 



The table above summarizes movement in children's arithmetic 
expectations over the first grade. It shows mainly an increase in the 
number of children expecting I's in the classrooms of white teachers, emd 
a decrease in the number of children expecting I's and 2's from the black 
teachers. As already noted, one black teacher (3) assigns more high marks 
than either white teacher and one (U) assigns fewer. The one who assigned 
lower marks accounts for almost one half of the downward movement in 
expectations. 

There is some downward shifting in conduct expectations for all 
teachers with perhaps a slightly stronger trend for the white teachers. 
Again, however, a very high proportion (80^) expect the highest mark so 
there is little variance to analyze. 

In sum while there appear to be a few differences in the distributions 
of expectations for pupils of the different teachers at both midyear and 
year end these do not seem to be strong differences associated with the 
riice of teachers. 
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Teacher Differences in Children's Expectations and Marks by Child's Race 



White teachers Induce slightly (but not significantly; - ^»h^» N.S. ) 
higher initial expectations for reading in children of both races ^,lk for 
white children and 1.0? for black children) than black teachers (1.32 for 
white students and l.hl for black students). In arithmetic and conduct 
teachers of both races appear to arouse escpectatlons of about the same 
level in both races of children (white teachers for white children— 1.86 
in arltlunetlc and 1.10 in conduct; white teachers for black children— 
1.70 in arithmetic and 1.00 in conduct; black teachers for white children— 
1.7^ in arithmetic and 1.03 in conduct; black teachers for black children— 
1.62 in arithmetic and 1.?.3 in conduct). 

So few children possess low expectations for reading at the end of 
first grade year (only 6f> of all children) that racial comparisons are not 
warranted. In arithmetic children with low e3q>ectatlons at the end of 
first grade are slightly more numerous (17^) but there is not a significant 
difference associated with race. In conduct all children but Z% expect 
I's or 2's (mostly I's) so no racial corqparlsons can be made. 

Numbers are small when double-race comparisons on marks are attempted. 
One overall pattern of double- race comparisons, however, emerges despite 
a few minor deviations. The general trend is for no interaction between 
race of child £uid race of teacher to appear. Black and white teachers 
treat black students similarly. Black and white teachers treat white 
students siiailarly. At times black students are treated differently than 
white students (e.g., in year end reading blacks tend to get lower marks) 
but the lower marks for blacks are assigned by both black and white teachers. 

In sum : comparison of marks and estpectations at both midyear and 
year end within categories of both teacher race and child race shows no 
interaction effects. No special effects on either marks or expectations 
seem to result from children participating in classrooms where the 
teacher's race is the same as the child's race, or where the teacher's 
race is different than the child's race. Small N's are Involved in the 
individual tables but the consistency of the pattern suggests the general 
conclusion is likely to hold as more cases are obtained. 
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CHAPTER 7 
DISCUSSION 



Tlie analyses and results presented in Chapters 3 through 6 are 
lengthy and are also at times fairly coniplex, as when changes in perfozn- 
ance or escpec tat ions are studied as a function of several predictor 
variables or as a function of prior differences in predictor variables. 
There are, nevertheless, many additional emalyses not reported here or 
even yet undertaken because of time limitations, analyses dealing with 
self-esteem, sociometric standing, absences from school, and others. 
Tlie writers tried to deal first with analyses which seemed most cracial 
to the causal interpretation of variability in children's expectations 
and marks. The reader is warned that subsequent analyses may lead to 
some revisions in the preliminary findings presented here. 

Data for the middle-class school aggregate two successive first- 
grade classes (Cohort 1 emd Cohort ?) and so allow fairly extensive 
analyses. More detailed breakdowns by sex, however, cannot be carried 
out even with two cohorts aggregated and must await addition of more 
cases by a future cohort. 

Data for the lower-class school are more restricted, being based on 
a single cohort of first graders. Since all black'^white comparisons stem 
from this cohort, the reader should bear in mind that the findings with 
respect to racial differences are both limited and tentative. 

So far all performance data for children are based on teacher's 
iuarks in reading, arithmetic and conduct. Attempts to aBsess "Language" 
and "Spelling" with Cohort 1 in the middle-class school proved unworkable 
because the children were unable to understand what these terms meant. 
These topics were therefore dropped. In second grade and later, standard- 
ized test scores will be studied as well as the influence of e^^ctation 
variables on standardized achievement measures. 



School Differences 



Several factors differ between the two schools besides social class. 
These factors are therefore confounded with class differences. To give 
some notion of population differences in the two communities served by 
the middle- and lower-class schools, utila from the 1970 U. S. Census for 
census tracts served by the schools are given in Table 7.1. The lower- 
class school serves most of four tracts, the middle-class school most of 
three. In terms of median of head of household education, the socied 
class composition of the two schools represents the customary definitions 
of middle class (some college) and lower class (high school or less). 
Income in the middle-class commonly is over twice that of the lower-class 
community. 



I 



ON 



UN 

SI 



lA 



CO 



H 
CM 



09 
CO 

g 



CO 



CVI 



00 



CM 
Q 



CO 



UN 
VO 



00 



f 

CO 



VO 



I 



UN 



VO 



• 

00 



2P 



ITS 



CM 
Q 



CO 
CO 



VO 



CO 

VO 



•I 



8 

•s 

CO 

CO 

I 

•H 



I 



o 

CO 



0\ 



VO 



5^ 



5? 



CO 



00 



o 



o 



• 

o 



CM 



CM 



CO 



o 

ERIC 



4^ 

I 



0) 

CO 4^ 



« 8 

CO 



7-2 



I 

0) CO 
R 0) 

«i 

§ o 

•H O 
tJ O 
0) 

s 



0) 

o 



9 

O 0) 

fl * ^ 

S 09 O 



The middle-class school is white segregated, its regular staff is 
entirely white, its classes have no aides. The school building was 
recently enlarged and renovated. It has a large library, a cafeteria 
where hot food is prepared on site at noon, and a large playground. It 
is located in a quiet residential neighborhood of well-landscaped 
individual homes worth $30,000-;|(30,CX}0 at current prices. 

The lower-class school has about SO'jb black children, an integrated 
(roughly ^-30) staff, an aide in all kindergarten and first grade classes. 
The school is set on a busy street corner in a densely populated urban 
residential area where traffic is very heavy. Nearby are "row houses" 
worth $6,000 to $20,000 depending on exact location and internal condition, 
and many have been turned into apartments for 3 or more families. Parts 
of the school building were erected before the turn of the century. The 
school has its own playground with much play equipment axid also a large 
gymnasium within the building and a large library. Hot lunches are 
delivered daily to the school. Many children in this school have family 
incomes low enough to qualify for free lunches under a federal program 
as Table 7*1 suggests. The school has the services of a social worker 
two days a week, a full-time guidance counselor, and "resource teachers" 
who help classroom teachers v/lth reading instruction. 

The two schools are very close to the same size with 3 to U first- 
grade classes in each depending on total enrollment. Staff quality in 
both schools is high and in both places there are many teachers who have 
tauf^ht at the school for several years. If anything, formal credentials 
of the staff in the lower-class school may be superior. Class size is 
about the same in the two schools. 

Both schools have kindergartens with half-day sessions attended by 
many of the children who later enter first grade. Very little study of 
these kindergartens has been s;ade so far, but one difference between 
schools very important for this research, is that the lower-class school 
gives report cards to kindergarteners. We atten^ted to query kindergar- 
teners about their expectations for report cards but were unable to 
procure responses we felt were valid. In many cases the child would not 
respond at all; in other cases it was clear the child did not understand 
the nature of the task. Thus the meaning of "first report card" may not 
be exactly the same from one school to the other. 

The reader must be cautious in attributing differences to social 
class or to intergration effects when there are so many other factors which 
also differ between schools. Fortunately with some notion of inter-cohort 
differences available for the middle-class school, one has a benchmark to 
guu/{e the size of intra-school variability. 



I.Q. Differences 



There is a significant yet relatively small difference between 
average I.Q. 's (PMA) of blacks and whites in the lower-class school (4.8 
points— 106.2 for whites and 101.4 for blacks). The I.Q. differences 



between schools is also small, 115 for the middle-class school and 103 in 
the lower-class school. These two averages are significeuitly different 
but in second grade at the middle- class school (the only second grade for 
which there are data) the average I.Q. for Cohort 1 turns out to be lOk 
on the Stanford Test of Mental Maturity. School- to- school differences, 
then are no larger than year-to-year differences between repeated tests 
of (approximately) the same children in a single school. 

r.Q. is a poor predictor of school performance in the early grades 
in the middle-class school. In first grade the highest correlation there 
is around .20 at midyear. A correlation of this magnitude ii^plies that 
only about ^ of the variance in performance is explained by I.Q. differ- 
ences. Correlations at the end of the year between I.Q. and reading, 
arithmetic, or conduct diminish and are non- significant in evexy case. 

The same pattern of correlations with I.Q. appears in second grade, 
with small but significant correlations at mLdye&r giving way to non- 
significant correlations at the end of first grade. Low correlations 
could be attributed to the relatively low reliability of the I.Q. measures 
(the year-to-year correlation between the different tests is .655), the 
low reliability of teacher's marks, or the relative restrictici in range 
on both variables. Since, however, teachers in the middle-class school 
are supposed to assign marks with ability peu'tialled out (i.e., by marking 
each child according to how his performance relates to his own ability), 
the small or non- significant correlations may also demonstrate that the 
teachers have successfully implemented the marking policies. 

Correlations between I.Q. and marks are much higher in the lower- 
class school and remain significant throughout the year. For blacks, 
midyear correlations between I.Q. and marks are considerably higher (by 
about 0.20) than for whites, but they are not significantly higher. 
There are no significant correlations between I.Q. and conduct for blacks 
at either time ( + .l6 and +.01) but all other I.Q. -Mark correlations are 
significant. The only statistically significant difference in I.Q. vs. 
Mark correlations by race occurs for conduct marks at the end of grade one 
(.01 for blacks vs. ,kl for whites). 

The differences between sizes of correlations from school to school 
are striking. The pattern indicates that teachers are effectively 
implementing the different marking policies in the two schools. In the 
lower-class school children are supposed to be marked with reference to 
grade- level performance, and I.Q. is the strongest single predictor of this. 
Therefore a high correlation between I.Q. and first marks indicates the 
validity of the teacher's marks given that marking policy. 

The effects on children are quite another matter. Average I.Q. 's 
are not very different in the two schools, but in the middle-class school 
a child is marked with I.Q. partialled out. As a consequence children 
can get different rewards in the two places for the same effort. If the 
middle-class child tries hard, he gets an "A". If the lower-class child 
tries hard, he still may not get an "A" unless he also happens to be bright. 
The two policies with respect to marking could lead children to form very 
different perceptions of efficacy. Another implication of marking policies 
is that in the lower-class school if the teacher is inept and the child 
reads below grade- level, the child pays an immediate penalty in terms of 



a poor nark. In the middle-class school, an inept teacher may have less 
damaging effects because the child is not marked so much in terms of his 
performance, at least in the. earliest grades. 

Differences in marking practices of first-grade teachers in the 
middle-class school from one cohort to the next are surprisingly large 
but nowhere near as large as inter school differences in marking practices. 
Full assessment of inter school differences, however, must await measure- 
ment of intercohort variability in the lower-class school as well. 



Children's Expectations 



In all three cohorts (two middle-class, one lower-class) correspond- 
ence between a child's expectations and his marks on his first report card 
is at chance levels. Children apparently have no genuine feeling for what 
marks they will get in any of the three areas. 

On the average children's expectations are too high, especially in 
the lower-class school. A natural question is whether this over- estimation 
should be taken at face value or whether it comes about for some other 
reasons, perhaps because children do not understand the task or because 
they are unable to report ax:curately, or because they are loath to 
acknowledge anything less than high e^ctations to the interviewer. 

Two kinds of supplementary evidence bear on the validity of verbally 
reported e^ectations especially for middle-class children. First, in one 
classroom the expectation sampling procedure was carried out on two 
occasions one week apart. There was a high degree of concordance (r = .76) 
between answers on both occasions. Second, every first-grader in Cohort 2 
of the middle-class school and all first-graders in the lower-class school 
were re- interviewed in de'^ith in June 1973. At that time the interviewer 
did not ask for expectations at all but rather probed children's under- 
standing of report card by asking questions like the following: What are 
report cards? What do "Reading" and "Arithmetic" mean? What do the 
numbers "1", "2", etc. on the report card mean? The purpose was to see 
whether children's understanding of report cards and of the numerical 
grading system was clear enough so the Interviewer thought they could make 
a meaningful response in an "e:q>ectation interview".* 

In the middle-class school report cards are better understood (about 
90% of children seemed to have a very good grasp) than In the lower-class 
school (about 70^ of children seemed to have a good grasp). Both groups 
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At the time of the initial interviews, when children guessed what 
they would get on their first report card children were asked if they 
knew meanings of the relevant terms etc. Only after the interviewer was 
satisfied that the child did understand was the Interview continued. 
Also, of course, the plastic replica of the report card aided in specifying 
and defining the task. This later probing Interview, undertaken by a 
dlfi'erent interviewer, served as a validity check. 



seemed to know what was at issue, however— that report cards were 
evaluations on how well they were doing in school. It is harder for an 
interviewer to get lower-class children to verbalize. As already 
mentioned, reports are issued in kindergarten in the lower-class school 
so these children have had several more report cards by the end of first 
grade than middle-class children, although attempts to get lower-class 
kindergarteners' expectations were not successful. The confusing system 
of reporting reading progress in first grade in the lower-class school has 
already been described. This confusion may have interfered with lower- 
class children's and parents' understanding of the marking system in 
reading. 

In both schools children differentiate between reading and arithmetic 
in giving their expectations. Expectations for reading exceed those for 
arithmetic and there is a significant negative correlation between the two 
areas in both schools. Such differentiation is strong evidence of the 
validity of children's verbea reports. In both schools the negative 
correlation between expectations is in sharp contrast to the positive 
correlation with marks actually awarded in the two areas. 

The average level of expectations was higher in the lower-class than 
in the middle-class school at the middle of first grade in both reading 
and conduct. Expectations for arithmetic were about the same. There was 
no difference between expectations of black children and white children 
in the same (lower-class) school for reading and conduct, but white child- 
ren were less optimistic about arithmetic than black children. 

The significant negative correlation between expectations in reading 
and in arithmetic in both schools strikes us as the strongest evidence 
for the validity of children's verbed reports of thplr expectations. 
Reliability over time, as in the one-week repeat elicitation, is encour- 
aging but could mean merely consistency on two occasions with no real 
understanding on the child's part at either time. Significant agreement 
from midyear to end-of-year can also be taken as an indicant of overtime 
reliability, but again could represent merely a recall and repetition of 
what was said earlier rather than a meaningful report. 

The average over-optimism in children's expectations in the middle- 
class school must be interpreted in light of the considerable variability 
in children's expectations at midyear in first grade. A large percentage 
of children are over - optimistic but a significant percentage are also over- 
pessimistic, especially before the first report card. The variability in 
children's expectations is larger than variability in parents' expecta- 
tions cr in marks awarded by teachers. 

The variability of children's initial expectations in the lower-class 
school is smaller than the variability of both marks and parents' expec- 
tations. This relatively small variability in children's expectations 
results from the extreme skewness (toward the highest mark) that is 
present in the children's expectations. By the end of first grade more 
cases of under- optimism begin to appear as expectations decline but 
expectations still remain, on the average, well above both marks and 
parents' exgpec tat ions. 



As became clear from further analyses, the level of a child's 
expectations per se is relatively uninformative in predicting future 
change. Rather the discrepance between the child expectations and rated 
performance is what accounts for change. 

The observed lack of racial differences in expectation levels is 
surprising in some ways but not surprising in others. One might expect 
small differences between racial groups because the children are still 
very young. One might also look for small differences, as we did, because 
this school has been integrated for a number of years and in considerable 
prior research on other topics in this same school we have found black 
children euid white children to be very similar on other measures (rate of 
volunteering and the like). When black children are inducted into new or 
strange situations with white children— as when say children are moved 
abruptly from a segregated to an integrated setting for experiments-- 
blacks may give evidence of low escpectations for themselves (see Cohen 
and Roper, 1973). These low es^ectations perhaps reflect the low expec- 
tations whites hold for blacks in unfamiliar situations, or the relatively 
more threatening nature of universities and scientists as perceived by 
black children. The observations for this research were made in a 
naturalistic rather than e;cperimental setting, and were rather unobtrusive. 



Expectations Over Time 



Since expectations generally exceeded marks received at midyear in 
both schools, one would think that expectations would decrease over the 
first-grade year. Quite the opposite happens in the middle-class school. 
Average expectation levels are higher at the end of first grade than at 
midyear. In the lower-class school expectations generally stayed at the 
same level, with only a moderate decline for conduct— it still maintained 
the highest average expectation of the three areas. In the lower-class 
school there is no difference, furthermore, between blacks and whites in 
how expectations change over the first-grade year. 

Middle-clapR children get better at anticipating their marks over 
the first-grade year. In particular they get better at not under-estima- 
ting. There is highly significant agreement between children's expecta- 
tions and the marks they receive in reading at the end of the year (not 
at midyear). For both arithmetic and conduct agreement approaches 
significGuice only at the end of the year. Children seem to learn quickly 
that teachers give no one the lowest mark and give few children the next- 
to- lowest, so children whose expectations were low at midyear modify their 
hopes upward. Children whose expectations were too high at midyear are 
less apt to modify them downward. 

Of those middle-class children whose expectations and narks disagree, 
the majority have expectations that are too high in reading and conduct, 
but not in arithmetic. Arithmetic Sfjems to be a topic that generates 
imcertainty (anxiety?) in first-gradsrs. Teachers do mark harder in it 
than in other subjects. 



Lower-class children do not show significant oatching of marks and 
expectations in any of the three areas at midyear or at the year end. 
Their high escpectat ions— much higher than middle-class children's— are 
not modified over the year despite hard marking by teachers— almost 50^ 
get very low marks in reading on the first report card. The most prevalent 
finding is that esgpectations remain constant and since most children 
start with high expectations, they are high also at year's end. Consider- 
able negative feedback seems to be ignored for lower-class children's 
expectations, if anything, increase rather than decrease, l^is is 
perplexing. One explanation might be that the children do not understand 
what they are doing when they report thair e3q>ectations to the inter- 
viewers. Both the end-of-year depth interviews and the negative associa- 
tion between e3q)ectations in reading and arithmetic, however, argue 
against this simple explanation. Also, results to be discussed in greater 
detail below show that particular children whose marks increase are 
recruited from those who did worse than they e^gpected at midyear, and 
their performance rises to equal expectations. Thus for certain children 
there is a causal role played by e3q>ectations. 

The interpretation of the paradox may lie in the Impact of day-to-day 
events rather them the impact of a relatively infrequent event like 
receipt of a report card. Reading marks were hard to understand (even 
for the researchers) and reports are given only twice, in Nov^ber and 
March. The way reading is taught j though, leads to a high level of 
individual reinforcement in the classroom administered by both the teacher 
and the aide. Each spends part of every day in small group sessions with 
children carrying out exercises related to reading. Children also spend 
part of every day doing individual written exercises related to reading. 
In these exercises there is a high rate of positive response by the adults 
and comments are given in extremely positive terms even if the child's 
performance has flaws, in order to encourage the child to keep trying. 
The uso of continuous positive feedback, sometimes non-verbal in the form 
of ralcins or M & M candies, is a characteristic of many of the reading 
programs developed for use with low achievers in recent years. In fact, 
children who are relatively poor performers may receive more encouragement 
than better performers, because they get so little self-generated rein- 
forcement from their success at academic tasks that the teacher may need 
to subsidize them to keep them working. (This possible differential 
positive reinforcement of poor performers might be invoked as an alternate 
explanation of why those children whose expectations exceed their marks 
show a disproportionate likelihood of raising marks but it, unlike an 
expectation explanation, is unable to e^^lain the other mark phenomena 
reported on page 5 - 38 and discussed later in this chapter.) 

The upshot is that poor-performing children are receiving strong 
positive feedback face-to-face day after day which contrasts sharply 
with the negative feedback they get in written form on two occasions 
during the school year. Persons of any age would be inclined to attend 
to massive positive feedback especially when the positive feedback is 
direct, immediate, and unambiguous, and to discount rare pieces of 
negative feedback. The child's expectations, therefore, may be premised 
on what he perceives as tho everyday behavior of his teacher and little 
affected by the infrequent and ambiguous written reports. Lower-class 
children are probably also less able than middle-class children to 



appreciate linguistic usages involving mitigating terms (Bernstein^ B.^ 
1970) so that the verbal comments teachers make may have a more positive 
impact than the teacher really intends. 

It is hard to imagine how a young child can be brought to learn 
without considerable encouragement. The middle-class childi however ^ may 
be better at encouraging himself when he starts school. For one things 
his lower expectation level will let him experience subjectively-defined 
success more often. If a child has very high expectations and his per- 
formance does not equal them^ the contrast leads to negative reinforce- 
ment. 

It may be dangerous to link heavy encouragement in the classroom with 
rigid marking practices as is done in the lower-class school. At some 
point a child must experience severe dissonance when he contrasts his high 
subjective ideas of his own performance^ reinforced by face-to-face 
interaction with the teacher ^ with the low marks he gets on report cards. 
In first grade dissonance may be minimal because marking practices are 
obscure. In later grades such dissonance is less equivocal— imagine the 
feelings of the child who is strongly encouraged through the first three 
years of school when he reviews his report cards and it dawns on him 
that all his marks have been low or failing. 



Marks 



In both schools there is significemt matching between marks in reading 
and arithmetic at both midyear and year end. In the middle-class school 
conduct marks show significant matching with both reading and arithmetic 
marks at all times during the first grade but the percentage of cases 
matching in these areas declines from midyear to year end. The lower- 
class school generally shows no relationship between conduct marks and 
the academic areas , an exception being arithmetic at year end where a 
low level of significance is attained. 

This general pattern is not too surprising. Marks in the academic 
areas might be expected to be related whether marking was done in terms 
of grade norms (as in the lower-class school) or in terms of "effort" 
(as in the middle-class school). Marking in terms of grade norms should 
produce conduct marks that are reasonably independent of academic marks— 
and this happens in the lower-class school. Marking the academic areas 
in terms of effort might be expected to initially produce marks that show 
a relation to conduct marks as the various behavioral manifestations of 
effort available to teachers for marking purposes seem closely related 
to classroom behavior or children's general attempts to behave. The 
obnerv^d relationship between arithmetic and conduct marks is of a halo 
^•rfoot In raiddle-cluM;: tfi^ichera' nuirklni-:, for there is much more corre- 
::pondence between arithmetic and reading than would be expected with T.^. 
effectively partialled out. It also seems reasonable that as the year 
progresses teachers should show Increased ability to separate children's 
academic efforts from their conduct efforts. 
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The variable correlations between I.Q. and marks from the middle of 
the year to the end of the year and between schools could also directly 
reflect the different marking policies in force at the two schools. In 
the middle-class school marks are supposed to be awarded in terms of the 
child's living up to his ability level whatever that level is. Pragmat- 
ically a measure of teachers' effective Implementation of this policy 
are the weak correlations of T.Q. with midyear marks and the lack of such 
correlations at year's end in both first and second grades at the middle- 
class school. Likewise the significant correlations between I.Q. and 
marks in the lower-class school can be seen as reflecting teachers' success 
at rating children in terms of grade-level performance. Also, of course, 
the relatively greater variability of both I.Q. scores and marks in the 
lower-class school favors the appearance of higher correlations between 
I.Q. and marks there in contrast to the middle^class school. 

The data shed light on marking practices per se. It is, first of 
all, surprising how much difference there is in marking distributions from 
one year to the next in the middle-class school. It is also surprising 
that so many very poor marks (up to 50^t) are given in the lower-class 
school on the first report card. The child who is receiving his or her 
first marks processes this feedback in ways that have effects on performance 
as the data show for middle-class children, and in ways that seem to have 
no effect for lower-class children. This lack of effect could iinply an 
"insulation effect"— when feedback is negative the child insulates himself 
against it. Such insulation may have advantages for preserving the self- 
image but it may be deleterious in terms of monitoring and thereby 
Improving performance. The implications of the two different marking 
systems found in the two schools— one in relation to ability and the other 
in relation to grade norms— may be profound in terms of long-range 
performance. Our data show lower-class children's expectations were nain- 
tulned at very high levels to the end of grade one. It will be Interesting 
to trace the course of these expectations In subsequent years as is 
planned for future work on this same research project. 

A number of comments about marking practices are in order. First, 
teachers are asked to evaluate the student's performance in 27 areas in 
the middle-class school £uid In 22 areas In the lowe.'-class school, surely 
a difficult task with 25 or 30 individual children to be rated. As far 
as first-graders are concerned, our research suggests that most have a 
fair understanding of what Is being rated in reading, arithmetic and 
conduct, but children do not understand other areas, such as spelling. 
Teachers themselves, when queried as to marking practices, show that they 
operate using different criteria and also that they themselves do not have 
very clear cut standards for what a topic like "Language" consists of. 

Teachers treat the sexes much the same at the start of school but 
there are trends for girls to get better marks in reading and conduct by 
trie end of first ^^'ade. 

It ifi impossible Lu tclJ wlmt Wn» overall effect of the lower-class 
school 'b marking practloos will bo, but as already suggested, uniformly 
poor murks may lead Llio ohIJd Lo (IjHrofJiurd marks or to revise his expec- 
tatloni*. drastically (the latter has not happened by the end of first 
Krade^. ^ ^. . 
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In the middle-class school narks are so high that there is little 
room for improvement, especially after final marks are given in grade 
one. Since it has been shovn that improvement in marks is possible if 
expectations exceed them, by creating a ceiling in marks teachers could 
be creating a ceiling for achievements 

For middle-class children, who rather uniformly get high marks at 
first, there may be long-run problems, first in the later grades of 
elementary school when child-to-child comparisons begin and then later 
in Junior high and high school when competition increases. Having 
expectations that are very high can lead to improved performance early 
but it could lead to adjustment problems later when the students finds 
himself up against stiffer coiqpetition. 



Discrepances Between Marks and Expectations 



The main purpose of this research is to uncover causal factors: do 
children's (or parents*) esqpectations causally impact on performance, and 
if so, how? Looking at average expectation levels one gets little feeling 
for how causal effects may occur. The following sections address the 
issue of causal sequences and examine in more detail what happens to 
children sequentially. 

One might think that children with high marks are more astute and 
that they therefore are better at forecasting their marks. However when 
expectation-mark agreement, is stratified by mark received at midyear, 
middle-class children of all performeuice levels seem to formulate their 
expectations in the seune way. Children receiving A's and those receiving 
D*s show approximately the seune distribution of e:g)ectations (as do those 
in between). 

Another reasonable hypothesis might be that children with low expec- 
tations, more than children with high expectations, would manifest a high 
amount of agreement between forecasts and performance because a poor per- 
f romance is under the child's control. But middle-class children who 
forecast low marks, ana they are relatively few in number, get about the 
same percentage of low marks as the class as a whole so this line of rea- 
soning also does not hold. 

For lower-class children the data do not permit a meaningful analysis 
of expectation-mark agreement at first report card in terms of mark or 
expectation level because, as remarked in Chapter 6, the combination of 
exceedingly high child expectations with relatively low mark assignments 
severely restricts possible patterns of outcomes. Also with data for only 
a single cohort the case base is small. 

By the end of first grade in reading there is increased expectation- 
mai'k agreement for middle-class children (increasing from about to 8o^). 
Change in marks and expectations seems related not to simple performance 
level or to simple expectation level, as already stated, but rather to 
the difference between performance and expectations. There are too few 
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data for the lower-class school to allow definitive analyses, but in both 
arithmetic and conduct, those children whose niarks do improve are dispro- 
portionately recruited from among those who earlier showed disagreement 
between mark and expectations. There is a tendency for both marks and 
expectations to change in a way to reduce e^ectation-mark discrepance. 

Feedback effects will now be discussed for the middle-class school 
only. If a child did worse than he expected at midyear, his expectations 
at yearns end tend to remain the same and his mark is brought into line. 
If he did better than he e^qpected, he is very likely to modify his 
expectations upward and keep the mark the same. There is a consistent 
observation as well of the following types of cases: a child's mark 
remaining the same if his mark equalled what he expected, and a child *s 
mark going down if he did better than he expected. 

The majority of children whether their midyear mark exceeded, 
equalled or was below their e^ectation can expect the same mark at year's 
end that they received at midyear, because there is great consistency in 
marks awarded. There is, nevertheless, a consistent (and significant 
for reading and conduct) observation of cases where marks move up, down, 
or remain stationary to agree with expectations. The data suggest that 
students* expectations causally affect the marks they receive even if the 
degree of association is rather low. 

Finding a causal role for expectations is one of the major achievements 
of this research. Only with data aggregated across twc (middle-class) 
cohorts to supply an adequate number of cases for analysis could this 
effect emerge. 

The data shew that a child's cognitive state ^ his estimate of his 
own performance, is instrumental in changing that level of performance. 
Unfortunately the data in the lower-class school are not yet extensive 
enough to allow an investi^/ation of the same sort, but what data are 
there, as already stated, point to the same finding. 

The other side of the coin must not be lost sight of, however. Of 
all the children followed, the most likely outcome is for both expectations 
and marks to remain the same. In the middle-class school where almost all 
children are awarded A's and B's and where expectation levels are high but 
not inordinately high considering the high average performance level, the 
stability of expectations leads to high expectation but not to an unreal- 
istic cognitive state. In the lower-class school, where expectations also 
persist at high levels, if future analyses bear out present tendencies, 
several outcomes are possible. First the child may insulate himself from 
negative feedback — he may block out awareness of, or dissonance over, low 
marks--and so be hindered in monitoring his performance. Second, if there 
are downward shifts in expectations from time to time as the child becomes 
aware of low-level performance, serious limits could be imposed thereafter 
on aspirations. 

We have shown so far that young children's expectations are high, 
resilient, and responsive mainly to positive feedback. More data are 
needed to see whether, if expectations are lowered, they are also resilient 
to change thereafter. 
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Parents' Expectations 



Parents and children do not agree on expectations in either school. 
Parents' expectations in the middle-class school show highly significant 
agreement with teachers' marks in reading at midyear, and with marks in 
arithmetic and conduct both at midyear and end of year. Parents' and 
teachers' marginals tend to correspond in the middle-class school. Parents 
tend to "play it safe", however, and most forecast a '*B" in all areas. 

Parents do seem able to identify those children who will perform 
poorly or very well. One might think that a superior performer would be 
more easily identified, and this is true even at the first report card. 
There are significant trends toward increased agreement between parents 
forecasts and year-end marks in arithmetic and conduct but not in reading. 
The pattern of this increased agreement is consistent with a causal 
hypothesis viewing parents' expectations as the independent variable. 
Parents from one year to the next in the middle-class school show a higher 
degree of consistency in their expectations than that shown by either 
teachers marks or children's expectations. 

When middle-class parents' e3g[)ec tat ions are not correct, they are in 
the direction of under-estimating in reading and arithmetic. There is a 
high correlation between parents' expectations in reading and arithmetic 
but no relation between either of these and conduct. 

Middle-class parents here demonstrate a compartmentalization of 
expectations for their children's school performance that Melvin Kohn (19^9) 
would predict. They do not see classroom behavior— ability to sit still, 
to be docile, to be deferent— as related to intellectual performance* 

In contrast to trends noted for middle-class parents, the parents of 
lower-class children do not have expectations that are at all veridical 
with first marks in reading. They do match marks in reading at the end 
of first grade, however. In the lower-class school the preponderance of 
low marks in reading at midyear is not anticipated by parents. Lower- 
class parents also over-estimate children's performance in arithmetic at 
midyear and agreement with marks in arithmetic is not significant at year's 
end. 

Lower-class parents expect higher marks than their children receive. 
Interestingly, on the average, their expectations are a little lower than 
middle-class parents' in reading and arithmetic but are about equal in 
conduct, 'inhere is a major difference between marking practices in the 
lower-class and middle-class schools as already noted. The act^l 
distributions used by teachers in the middle-class school agree well with 
what both sets of parents anticipate. The low marks awarded in the lower- 
class school lead to parents there over -estimating their child's performance, 
whereac in the middle-class school there is under-estimation, if anything. 

Parents' Involvement with school, or the attention parents give to 
school matters, might be indexed by the levels of response noted in this 
research from the two sezs of parents. In the middle-class school we were 
able to attain a response rate of Q6% using a mail questionnaire when we 



souglit parents' expectations. For Cohort 1 in first grade we were able 
to see $2% of the parents when they visited school during American 
Education week. Using a trained interviewer to contact lower-class 
parents, preceded by a flyer carried home by the child and persisting 
through three call- backs, we were able to secure responses from of 
lower-class parents. When we attempted to interview lower-class parents 
by seeing those who came to school in American Education week, our 
response rate was less than 10^. 

One set of parents thus seemed in close touch with school, the other 
set not. In the middle-class school the PTA actually aided the researchers 
in contacting parents. The PTA in the lower-class school is inactive. 
Communication fluency may be what causes middle-class parents to have 
more realistic expectations for a child's perf ormsuice . By communicating 
with other parents and with teachers, middle-class parents may get enough 
information to be accurate. 



Sex Differences 



Reading marks in both schools do not show a significant association 
with sex at midyear in either school or at year end in the lower-class 
school. By year's end, however, more boys in the middle-class school get 
the poorer marks. At no time is there any significant association between 
marks in arithmetic and sex in either school. 

In conduct there are large sex differences at midyear and at year's 
end favoring girls in the lower-class school. In the middle-class school 
there is a strong trend for the lowest conduct nr&rxs to go to boys. There 
is a slight tendency for middle-class parents to expect poor conduct from 
boys as compared to girls, and surprisingly, lower-class parents expect 
the same performance in conduct for boys and girls. The expectations of 
middle- and lower-class parents for boys and girls are at odds with what 
might have been predicted. 



Racial Differences 



Results in terms of racial breakdowns are not extensive because they 
are based on only one cohort. Some preliminary and tentative findings with 
respect to race are nevertheless worth summarizing. 

In first marks, there is no association between race and marks in 
reading, although both In arithmetic and conduct white children's marks 
tend to be hlger (significantly so for conduct). The differences are very 
small liowever by practical criteria, less than one-quarter of a unit in 
both ureas. 

At the end of first grade there is movement toward whites getting higher 
marks In reading (2.U3 vs. 2.69) but the improvement in marks of both 
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groups over the year is uuoh more striking than the difference between 
them at year's end. Whites' average lji5>rovement is O.71 grade points and 
blacks' average improvement is 0.U7 grade points. In arithmetic and 
conduct the difference by race is about the same as at midyear and only 
slightly favors whites. 

Children of both races expect the same marks. Parents of both races 
have similar eaqpec tat ions in reading and arithmetic but white parents 
expect better conduct marks (I.6I vs. 2.08). Both sets of parents have 
expectations which are not consistent with the marks their children 
receive. There is some limited indicatibn that white children are more 
cognizant of earlier marks in shaping their year-end expectations than 
black children. 

Ivo teachers are white arid two black emd each class has approximately 
kO% white and 60% black children. As far as these data shed light on 
teacher differences, there do not appear to be any differences associated 
with race for teachers. 



Implications and Future Research 



More work is needed both to extend the time span over which particular 
children are followed and also to increase the number of first-graders for 
study. Many more analyses, and more searching analyses in terms of causal 
hypotheses, were possible for middle-class children because two cohorts 
could be aggregated.' An even larger sample is needed for the lower-class 
school because there one wishes to carry out racial breakdowns as well. 
Important measures still missing are standardized test scores for reading 
and arithmetic. (These tests are not given in first grade.) They repre- 
sent a criterion of performance badly needed as a dependent variable in 
our analysis. Teachers' marks are compounded of expectations and judgment 
related to r.Q. and other factors, making such marks of great interest 
from the standpoint of Pygmalion effects or of interpersonal influences, 
but they are not very good barometers of performance. In fact, major 
future interest will attach to the relations between expectations, 
teachers' mark^ and standardized achievement test scores. 

To date some significant findings are available, however. Children 
at the start of school have a great deal of optimism about their ability 
to perform. Their expectations for the most part are high, particularly 
for reading. This suggests that the "protective circle of the family", 
whether middle-class or lower-class, nurtures children who are self- 
confident arid optimistic when they start school. The expectations of 
parents further supports this claim, for both middle-class and lower-class 
parents are on the whole optimistic about their first grader's likely 
success in school, even though lower-class parents are not quite as 
optimistic as middle-class parents. What is somewhat perplexing is how 
lower-class parents can express such confidence if they have older 
children whose performance has been poor, as must often be the case. 
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Parents' expectations are resArkably constant in the middle-class 
school and show significant agreement with children's marks in all three 
areas. Lower-class parents' expectations agree with marks in only one 
instance^ at the end of first grade in reading, after children's reading 
marks have improved noticeably. Data are limited but changes in marks 
in the middle-class school seem more closely attuned to discrepances 
between parents' extpectation and marks than discrepances between children's 
expectations and marks* (The possibility that joint discrepances of the 
same sort by both parents and children are occurring has yet to be studied*) 
Add to this the fact Just noted that lower-class parents' expectations 
match marks at the end of first grade in reading but not at the middle of 
the year, euid the inference is that parents' are influential in cases of 
children whose marks move up. 

There is, however, a question clearly raised at this time which can 
be answered only with future data. It is that movement in marks is 
associated with a prior discrepance between expectations and marks such 
that differences between the two are abolished— they move toward 
consistency. If the middle-class child did better than expected, his 
mark tends to drop. If he did worse than expected his mark tends to rise, 
but even if his mark does not rise his expectations are more likely to 
remain the same than drop. 

In the lower-class school no one did better than he expected. M&rks 
are much lower and there is a significant tendency for children whose 
marks in arithmetic and conduct improve over the year to be disproportion- 
ately recruited from those whose expectations exceeded their first 
semester mark. This statistical significance, however, is linked to an 
effect that is rather small in absolute size, (in reading a similar trend 
is not significant but is worth attention because it is consistent with 
the trends in arithmetic and conduct.) 

Improvement in marks in both schools, then, is related to earlier 
mark expectation discrepances but improvement in marks occurs for only a 
minority of children in either school. Expectations move more easily than 
marks and also in a way to reduce expectation-mark discrepance, but are 
characterized by a bouyancy effect . They tend to rise more easily than they 
tend to fall. Lower-class children 'n expectations do not fall, and their 
expectations are very high to start with. A disequilibrium between marks 
and expectations is noticeable at midyear first grade and persists over 
the year because marking is "hard" and expectations are extremely high. 
What happens as time goes -on? It would be difficult for marks to keep 
improving when children are marked in terms of grade-level norms and yet 
so far there has not been a decline in expectations. Do expectations drop 
later? One would think they would have started to drop already but they 
have not. 

In an earlier review of reading models and language socialization 
one of the authors (Entwlsle, 1971) noted "evidence is accumulating that 
socioeconomic status is a more crucial Influence on reading performance 
than I.Q. " The present report looks not at linguistic factors or cognitive 
habits, as the earlier paper did, but looks at social factors or organ- 
izational (school) variables. The same conclusion can be stated here, 
however, with even more conviction. The average difference in I.Q. is 



not large between schools or between races but reading and arithmetic 
performance by first graders differs by race and school, and all prior 
experience suggests that there will be signif iceuit differences on 
standardized tests in reading and arithmetic between schools when these 
children are tested in second grade. Informal classroom observation reveals 
very large differences in reading proficiency at the end of first grade. 

We intend to investigate the present data set much more thoroughly 
to gain insight into what may be responsible for these differences. 
Parents of lower-class children may be setting themselves up to administer 
negative reinforcement— by having higher expectations than their children 
con deliver they may be boxing themselves into a corner of not being able 
to be pleased with the child's performance. By marking on an absolute 
scale the teachers in the lower-class school may also be removing their 
ability to provide positive reinforcement. If a child is trying hard 
and sees no obvious result, i.e., no high mark, his effort must slack off. 

We suspect that forces are already at work during first grade in 
addition or prior to any reinforcement effects . One of these is the 
effective time the child is coupled into the learning system . We have 
< yet to ancLLyze absence data which may be used as an indicator of "effective 
time". We have noted informally, however, a much higher absentee rate 
in the lower-class school. We also have noted much less of children's 
and teachers' time spent in cognitive activities because children are late 
or not prepared for school (unsuitably dressed and the like). To this 
one can add, although we have as yet no measure of it, the likelihood that 
the lower-class children are coming to school having had a poor breakfast 
or no breakfast at all. We intend to investigate all these possibilities 
as the research continues. 
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